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SPECIFICATION 



THE RICE RESTORER GENE TO THE RICE 
BT TYPE CYTOPLASMIC MALE STERILITY 



5 FIELD OF THE TNVRMTTDM 

The present invention relates to the rice restorer 
gene to the rice BT type cytoplasmic male sterility. 



The present application claims priority based on 
10 Japanese Patent Application No. 2002-107560 filed on July 
5, 2002. The entire disclosures of the patent application 
are incorporated herein. 



PRIOR ART 

Rice is a self -fertilizing plant, so in order to 
perform crossing between varieties, self-fertilization must 
first be avoided by removing all stamens in a glumaceous 
flower just before flowering and, then fertilization is 
effected with pollens from the parent variety with which it 
is to be crossed. However, this manual crossing method is 
entirely unsuitable for producing a large quantity of 
hybrid seeds on a commercial scale. 

Accordingly, hybrid rice is produced by the triple- 
crossing system which makes use of cytoplasmic male 
sterility. In the triple-crossing system, the following 
three lines are employed, i.e., a sterile line having male 
sterile cytoplasm, a restorer line having Rf-l gene and a 
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maintainer line having the same nuclear gene as that of the 
sterile line but not having any sterile cytoplasm. By 
using these three lines, (i) hybrid seeds can be obtained 
through fertilization of the sterile line with the pollen 
5 of the restorer line whereas (ii) the sterile line can be 
maintained through its fertilization with the pollen of the 
maintainer line. 

When employing the BT type male sterile cytoplasm in 
10 the triple-crossing system, it is important to breed rice 
of the restorer line and to this end, it is necessary to 
ensure that the rice at every stage of breeding maintains 
Rf-1 gene and that the Rf-1 gene is homozygous at the final 
stage. It also becomes necessary in the triple-crossing 
15 system to check to ensure that the variety used as the 
restorer line possesses Rf-1 gene, or to check for the 
presence of Rf-1 gene in order to ensure that the resulting 
hybrid seeds have restored fertility. 

20 In order to genotype the locus of Rf-1 gene in a 

plant, it has been necessary that Fl plants be first formed 
from hybrid seeds obtained by crossing the plant to be 
genotyped to a standard line and then self -fertilized, 
followed by investigating the incidence of individuals that 

25 can produce seeds at a frequency higher than a certain 

level (e.g. 70-80% or more). The standard line refers to 
the maintainer line, the sterile line or a set of the two 
lines, and it is appropriately chosen depending upon 



whether the cytoplasm of the individual under test is of BT 
type or normal type or unknown. If the standard line is a 
sterile line, it is crossed to the individual under test as 
the female parent and if the standard line is a maintainer 
5 line, it is crossed as the male parent. 

However, these techniques require a huge amount of 
labor and time to carry out. As a further problem, 
fertilization for seed production is sensitive to 

10 environmental factors and if an investigation is made in an 
unfavorable environment such as cold climate or 
insufficient daylight, sterility may be caused irrespective 
of the genotype constitution, with the result that 
genotyping of the locus of Rf-1 gene cannot be performed 

15 accurately. 

With a view to solving these problems, it has recently 
been proposed that Rf-1 gene be checked for its presence by 
a technique of molecular biology. The technical idea of 

20 this technique lies in checking for the presence or absence 
of Rf-1 gene by detecting base sequences linked to Rf-1 
gene (such sequences are hereunder referred to as DNA 
markers). Note that it is not possible to directly detect 
Rf-1 gene since the DNA sequence of Rf-1 gene has not been 

25 clarified so far. 



For example, it has been reported that the locus of 
Rf-1 gene in rice is present on chromosome 10 and located 
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between DNA marker (RFLP marker) loci G291 and G12 7 which 
can be used in restriction fragment length polymorphism 
analysis (RFLP) (Fukuta et al., 1992, Jpn J. Breed. 42 
(supl. 1) 164-165). This is a known method of genotyping 
5 the locus of Rf-1 gene by investigating the genotypes of 
DNA marker loci G2 91 and G12 7 which are linked to Rf-1 
gene . 

However, the conventional molecular biology techniques 
10 have several problems. First, they use RFLP markers which 
need to be detected by Southern blot analysis . In order to 
perform Southern blot analysis, DNA at the microgram level 
needs to be purified from the individual under test and, in 
addition, there is a need to carry out a sequence of steps 
15 comprising treatment with restriction enzymes, 

electrophoresis, blotting, hybridization with a probe and 
signal detection; this not only involves considerable labor 
but it also takes about one week to obtain the test 
results. 

20 

The second problem is that since the gene map distance 
between RFLP marker loci G291 and G127 is as long as about 
30 cM (corresponding to about 9000 kbp in rice DNA), the 
probability for the occurrence of double recombination in 
2 5 the region would be a few percent and hence, it is not 

always guaranteed that the genotype of the locus of Rf-1 
gene can be estimated correctly by the markers. 
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Thirdly, when the presence of Rf-1 gene is estimated 
by detecting RFLP marker loci G291 and G127, not only Rf-1 
gene but also the gene region between those loci are 
introduced into the fertility restorer line selected as the 
5 result of breeding. As a consequence, the introduced DNA 
sequence will have a chromosomal region of 30 cM or longer 
from the Rf-1 gene donor parent, and this presents the risk 
of introducing a deleterious gene that may potentially be 
present within that region. 

10 

In order to solve these problems, there have been 
developed a dominant DNA marker (Japanese Patent Public 
Disclosure No. 222588/1995) and a co-dominant DNA marker 
(Japanese Patent Public Disclosure No. 313187/1997), both 

15 of which are linked to the locus of Rf-1 gene. These 

markers are linked to the locus of Rf-1 gene, their genetic 
distances from Rf-1 gene respectively being 1.6 ± 0.7 cM 
(corresponding to about 480 kbp in rice DNA) and 3.7 ± 1.1 
cM (corresponding to about 1110 kbp in rice DNA), and their 

20 loci being on opposite sides of the locus of Rf-1 gene. 
Hence, the presence of Rf-1 gene can be estimated by 
detecting the presence of both the locus of the dominant 
PCR marker and that of the co- dominant PCR marker. The use 
of the co-dominant PCR marker also enables us to estimate 

25 as to whether the locus of Rf-1 gene is homozygous or 
heterozygous . 



However, the use of these PCR markers still involve 
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several problems. The co-dominant marker has a genetic 
distance of 3.7 ± 1.1 cM from the locus of Rf-1 gene, and 
the problem of potentially high frequency of recombination 
with the locus of Rf-1 gene has not been fully dissolved. 
5 As a result, speaking of the co-dominant marker itself, 
correct detection can be made as to whether it is 
homozygous or a heterozygous. However, if recombination 
occurs between the locus of the co- dominant marker and that 
of Rf-1 gene, the genotype of Rf-1 gene locus cannot be 

10 determined correctly, particularly as to whether it is 
homozygous or heterozygous. On the other hand, if the 
dominant marker is used to genotype the locus of Rf-1 gene, 
the marker will detect individuals indiscriminately 
irrespective of whether they are homozygous (Rf-1 /Rf-1) or 

15 heterozygous (Rf-l/rf-1) with respect to Rf-1 gene* 
Therefore, even if the co- dominant marker is used in 
combination with the dominant marker in order to genotype 
the locus of Rf-1 gene, it is not possible to correctly 
distinguish individuals having Rf-1 gene homozygously from 

20 those having the gene heterozygously . Further, if no 

amplification product is obtained in PCR using the dominant 
marker, one cannot deny the possibility that this is due to 
some problems in the experimental procedure. As a further 
problem, since the genetic distance between the co-dominant 

25 marker and the dominant marker is as great as about 5.3 cM 
(around 1590 kbp) , the size of the chromosomal region 
introduced from the Rf-1 gene donor parent cannot be 
limited to a sufficiently small value to prevent any 



.1 



concomitant introduction of a deleterious gene which may be 
contained in that region. 

Japanese Patent Public Disclosure No. 139465/2000 
5 describes co-dominant PCR markers that were developed on 
the basis of the base sequences of RFLP markers located in 
the neighborhood of Rf-1 gene on chromosome 10 of rice. 
However, most of those PCR markers are spaced from the Rf-1 
gene by a genetic distance greater than about 1 cM. 

10 

DISCLOSURE OF THE INVENTION 

An object of the present invention is to provide 

methods for restoring rice fertility. A method of the 

present invention comprises introducing a nucleic acid into 

15 rice, wherein the nucleic acid encodes the amino acid 

sequence of SEQ ID NO. 75, or an amino acid sequence which 
is identical to at least 70% of the amino acid sequence of 
SEQ ID NO. 75, and which functions to restore fertility. In 
one of the preferred embodiments of the present invention, 

20 the nucleic acid encoding the amino acid sequence of SEQ ID 
NO. 75, or an amino acid sequence which is identical to at 
least 70% of the amino acid sequence of SEQ ID NO. 75 is 
selected from nucleic acids of the following a) - p) : 

a) a nucleic acid comprising the bases 215-2587 of 
25 SEQ ID NO: 69; 

b) a nucleic acid comprising the bases 213-2585 of 
SEQ ID NO: 70; 

c) a nucleic acid comprising the bases 218-2590 of 
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SEQ ID NO: 71; 

d) a nucleic acid comprising the bases 208-2580 of 
SEQ ID NO: 72; 

e) a nucleic acid comprising the bases 149-2521 of 
5 SEQ ID NO: 73; 

f ) a nucleic acid comprising the bases 225-2597 of 
SEQ ID NO: 74; 

g) a nucleic acid comprising the bases 43907-46279 of 
SEQ ID NO: 27; 

10 h) a nucleic acid comprising the bases 229-2601 of 

SEQ ID NO: 80; 

i) a nucleic acid comprising the bases 175-2547 of 
SEQ ID NO: 81; 

j) a nucleic acid comprising the bases 227-2599 of 
15 SEQ ID NO: 82; 

k) a nucleic acid comprising the bases 220-2592 of 
SEQ ID NO: 83; 

1) a nucleic acid comprising the bases 174-2546 of 
SEQ ID NO: 84; 

20 m) a nucleic acid comprising the bases 90-2462 of SEQ 

ID NO: 85; 

n) a nucleic acid which is identical to at least 70% 
of the nucleic acid of any of a) - m) , and which functions 
to restore fertility; 
25 o) a nucleic acid which hybridizes to the nucleic 

acid of any of a) - m) under a moderate or high stringent 
condition, and which functions to restore fertility; and 

p) a nucleic acid wherein one or a plurality of 



base(s) is deleted from, added to or substituted from the 
nucleic acid of any of a) - m) , and which functions to 
restore fertility. 

5 Preferably, in the method of the present invention, 

the nucleic acid encoding the amino acid sequence of SEQ ID 
NO. 75, or an amino acid sequence which is identical to at 
least 70% of the amino acid sequence of SEQ ID NO. 75, and 
which functions to restore fertility, meets at least one of 
10 the following requirements 1) - 12): 

1) a base corresponding to the base 1769 of SEQ ID 
NO. 69 is A; 

2) a base corresponding to the base 1767 of SEQ ID 
NO. 70 is A; 

15 3) a base corresponding to the base 1772 of SEQ ID 

NO. 71 is A; 

4) a base corresponding to the base 1762 of SEQ ID 
NO. 72 is A; 

5) a base corresponding to the base 1703 of SEQ ID 
20 NO. 73 is A; 

6) a base corresponding to the base 1779 of SEQ ID 
NO. 74 is A; 

7) a base corresponding to the base 1783 of SEQ ID 
NO. 80 is A; 

25 8) a base corresponding to the base 1729 of SEQ ID 

NO. 81 is A; 

9) a base corresponding to the base 1781 of SEQ ID 
NO. 82 is A; 



10) a base corresponding to the base 1774 of SEQ ID 
NO. 83 is A; 

11) a base corresponding to the base 1728 of SEQ ID 
NO. 84 is A; or 

5 12) a base corresponding to the base 1644 of SEQ ID 

NO. 85 is A. 

Another object of the present invention is to provide 
a method for discerning whether a subject rice individual 

10 or a seed thereof has the Rf-1 gene or not, utilizing a 
nucleic acid encoding the amino acid sequence of SEQ ID 
NO. 75, or an amino acid sequence which is identical to at 
least 70% of the amino acid sequence of SEQ ID NO. 75, and 
which functions to restore fertility. Preferably, in an 

15 embodiment of the present method, the subject rice 

individual or the seed thereof is determined to have the 
Rf-1 gene, in the case that the nucleic acid encoding the 
amino acid sequence of SEQ ID NO. 75, or an amino acid 
sequence which is identical to at least 70% of the amino 

20 acid sequence of SEQ ID NO. 75, and which functions to 

restore fertility, meets at least one of the following 

requirements 1) - 12): 

1) a base corresponding to the base 1769 of SEQ ID 

NO. 69 is A; 

25 2) a base corresponding to the base 1767 of SEQ ID 

NO. 70 is A; 

3) a base corresponding to the base 1772 of SEQ ID 
NO. 71 is A; 
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4) a base corresponding to the base 1762 of SEQ ID 
NO. 72 is A; 

5) a base corresponding to the base 1703 of SEQ ID 
NO. 73 is A; 

5 6) a base corresponding to the base 1779 of SEQ ID 

NO. 74 is A? 

7) a base corresponding to the base 1783 of SEQ ID 
NO. 80 is A; 

8) a base corresponding to the base 1729 of SEQ ID 
10 NO. 81 is A; 

9) a base corresponding to the base 1781 of SEQ ID 
NO. 82 is A; 

10) a base corresponding to the base 1774 of SEQ ID 
NO. 83 is A; 

15 11) a base corresponding to the base 1728 of SEQ ID 

NO. 84 is A; or 

12) a base corresponding to the base 1644 of SEQ ID 
NO. 85 is A. 

20 Another object of the present invention is to provide 

a method for inhibiting the function of the Rf-1 gene to 
restore fertility. The inhibition method of the present 
invention comprises, in an embodiment, introducing an 
antisense having at least 100 bases in length, and being 

25 selected from base sequences complementary to a nucleic 

acid encoding the amino acid sequence of SEQ ID NO. 75, or 
an amino acid sequence which is identical to at least 70% 
of the amino acid sequence of SEQ ID NO. 75, and which 



functions to restore fertility. 



Still another object of the present invention is to 
provide a nucleic acid encoding the amino acid sequence of 
5 SEQ ID NO. 75, or an amino acid sequence which is identical 
to at least 70% of the amino acid sequence of SEQ ID N0.75 # 
and which functions to restore fertility. The present 
invention provides, in an embodiment, a nucleic acid 
selected from nucleic acids of the following a) - p) : 
10 a) a nucleic acid comprising the bases 215-2587 of 

SEQ ID NO: 69; 

b) a nucleic acid comprising the bases 213-2585 of 
SEQ ID NO: 70; 

c) a nucleic acid comprising the bases 218-2590 of 
15 SEQ ID NO: 71; 

d) a nucleic acid comprising the bases 208-2580 of 
SEQ ID NO: 72; 

e) a nucleic acid comprising the bases 149-2521 of 
SEQ ID NO: 73; 

20 f ) a nucleic acid comprising the bases 225-2597 of 

SEQ ID NO: 74; 

g) a nucleic acid comprising the bases 43907-46279 of 
SEQ ID NO: 27; 

h) a nucleic acid comprising the bases 229-2601 of 
25 SEQ ID NO: 80; 

i) a nucleic acid comprising the bases 175-2547 of 
SEQ ID NO:81; 

3) a nucleic acid comprising the bases 227-2599 of 



SEQ ID NO: 82; 

k) a nucleic acid comprising the bases 220-2592 of 
SEQ ID NO: 83; 

1) a nucleic acid comprising the bases 174-2546 of 
5 SEQ ID NO: 84; 

m) a nucleic acid comprising the bases 90-2462 of SEQ 
ID NO: 85; 

n) a nucleic acid which is identical to at least 70% 
of the nucleic acid of any of a) - m) , and which functions 
10 to restore fertility; 

o) a nucleic acid which hybridizes to the nucleic 
acid of any of a) - m) under a moderate or high stringent 
condition, and which functions to restore fertility; and 

p) a nucleic acid wherein one or a plurality of 
15 base(s) is deleted from, added to or substituted from the 
nucleic acid of any of a) - m) , and which functions to 
restore fertility, 

BRIEF DESCRIPTION OF THE DRAWING 

20 Fig j: 1_ shows the results of chromosomal walking 

started from the RFLP marker locus S12564. 

Fig_. 2^ shows an alignment of lambda clone contigs in 

relation to the BAC clone AC068923. 

Fig. 3^ shows the chromosomal organization of 

25 recombinant pollens proximal to the Rf-1 locus (all 

fertile) as mapped in close proximity to the Rf-1 locus 
based on the genotypes at the marker loci of Id individuals 
(RSI, RS2 , RC1-8) generated from the pollens. White bars 
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represent japonica regions and black bars represent indica 
regions . 

Fig* 4 is a gene map in which the locus of Rf-1 gene 
on chromosome 10 of rice is positioned on a linkage map in 
5 relation to various markers; the values of map distance 
were calculated from the segregation data from 1042 Fl 
individuals . 

Fig. 5 shows fragments from 10 genomic clones used for 
— 

the identification of the Rf-1 region by complementation 
10 assays. Lambda clones obtained by chromosomal walking 

(thin lines) were used for complementation assays of the 

chromosomal regions shown by bold lines. XSF18 was found 

to contain a deletion shown by dotted line. 

Fig. 6 shows the results of complementation assays 
15 using a 15.7 kb fragment from XSG16 (Example 10) and a 16.2 

kb fragment from XSF18 (Example 8) . The plant transformed 

with the 15.7 kb fragment from XSG16 has restored fertility 

as proved by ears bowing. 

Fig. 7 is a schematic picture showing the Rf-1 gene 
20 structure. White bars and black bars represent exons and 

introns, respectively. Numbers of base pairs are shown for 

the exon portions . 

F ig. 8 i s a schematic picture showing positional 

relationships between the IR24 genome fragment subjected to 
25 the complementation assays, probes used for the cDNA 

library screening and the Rf-1 gene deduced from the 

isolated cDNAs. White bars and black bars in the Rf-1 gene 

represent exons and introns, respectively. Numbers of base 
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pairs are shown for the exon portions. 

BEST MODES FOR PERFORMING THE INVENTION 

We began by restricting the Rf-1 locus to a very small 
5 region on chromosome 10. On this basis, we developed PCR 
markers proximal to the Rf-1 locus and found a method for 
detecting the Rf-1 gene by utilizing on the linkage of 
these PCR markers to the Rf-1 locus. Specifically, the 
presence of the Rf-1 gene is tested and individuals 

10 homozygous for the Rf-1 gene are selected by genotyping at 
the novel PCR marker loci proximal to the Rf-1 locus on the 
basis that the Rf-1 locus is mapped between the PCR marker 
loci S12564 Tsp509I and C1361 Mwol on chromosome 10 of 
rice. We previously filed a patent application for the 

15 method for detecting the Rf-1 gene under Japanese Patent 

Application No. 2000-247204 on August 17, 2000. The entire 
disclosure of the patent application is incorporated herein 
by reference. 

20 I- Methods for estimating the genotype at the Rf-1 locus 
described in Japanese Patent Application No, 2QQQ-247204 

Japanese Patent Application No. 2000-247204 describes 
methods for determining whether or not a rice individual or 
seed under test has the Rf-1 gene on the basis that the Rf- 

25 1 locus is mapped between the PCR marker loci S12564 and 
C1361 on chromosome 10 of rice. 

Markers 
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Primer pairs designed to be specific to particular 
regions near the locus of Rf-1 gene are used in PCR and the 
amplification products are treated with particular 
restriction enzymes; upon electrophoresis, rice of indica 
5 lines in some cases provide an observable band of a 

different size from that of rice of Japonica lines* This 
band which is characteristic of indica lines is herein 
referred to as the Rf-1 linked band. Now that it has been 
made clear by the present inventors that the locus of Rf-1 
10 gene is located between PCR markers S12564 Tsp509I and 

C1361 Mwol on chromosome 10 of rice, the skilled artisan 
can appropriately develop and employ PCR markers that are 
present in the neighborhood of Rf-1 gene. 

15 For instance, according to the invention, a rice 

individual under test is checked to see if its genome 
contains at least one of the PCR markers listed below, 
thereby determining whether the individual under test has 
Rf-1 gene linked to those PCR markers: 

20 (1) marker 1: PCR marker R1877 EcoRI which, when rice 

genomic DNA is subjected to PCR with DNA primers having the 
sequences of SEQ ID NO:l and SEQ ID NO: 2, can detect 
polymorphisms between rice individuals of the japonica and 
indica lines depending on whether the amplification 

25 products have a recognition site for restriction enzyme 
EcoRI; 

(2) marker 2: PCR marker G4003 Hindlll (SEQ ID NO:19) 
which, when rice genomic DNA is subjected to PCR with DNA 
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primers having the sequences of SEQ ID NO: 3 and SEQ ID 
NO:4 # can detect polymorphisms between rice individuals of 
the japonica and indica lines depending on whether the 
amplification products have a recognition site for 
5 restriction enzyme Hindlll; 

(3) marker 3: PCR marker C1361 Mwol (SEQ ID NO: 20) 
which, when rice genomic DNA is subjected to PCR employing 
DNA primers having the sequences of SEQ ID NO: 5 and SEQ ID 
NO: 6, can detect polymorphisms between rice individuals of 

10 the japonica and indica lines depending on whether the 
amplification products have a recognition site for 
restriction enzyme Mwol; 

(4) marker 4: PCR marker G2155 Mwol (SEQ ID NO: 21) 
which, when rice genomic DNA is subjected to PCR with DNA 

15 primers having the sequences of SEQ ID NO: 7 and SEQ ID 

NO: 8, can detect polymorphisms between rice individuals of 
the japonica and indica lines depending on whether the 
amplification products have a recognition site for 
restriction enzyme Mwol; 

20 (5) marker 5: PCR marker G291 Mspl (SEQ ID NO: 22) 

which, when rice genomic DNA is subjected to PCR with DNA 
primers having the sequences of SEQ ID NO: 9 and SEQ ID 
NO: 10, can detect polymorphisms between rice individuals of 
the japonica and indica lines depending on whether the 

25 amplification products have a recognition site for 
restriction enzyme Mspl; 

(6) marker 6: PCR marker R2303 BslI (SEQ ID NO: 23) 
which, when rice genomic DNA is subjected to PCR with DNA 
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primers having the sequences of SEQ ID NO: 11 and SEQ ID 
NO: 12, can detect polymorphisms between rice individuals of 
the japonica and indica lines depending on whether the 
amplification products have a recognition site for 
5 restriction enzyme BslI; 

(7) marker 7: PCR marker S10019 BstUI (SEQ ID NO: 24) 
which, when rice genomic DNA is subjected to PCR with DNA 
primers having the sequences of SEQ ID NO: 13 and SEQ ID 
NO: 14, can detect polymorphisms between rice individuals of 

10 the japonica and indica lines depending on whether the 
amplification products have a recognition site for 
restriction enzyme BstUI; 

(8) marker 8: PCR marker S10 602 Kpnl (SEQ ID NO: 25) 
which, when rice genomic DNA is subjected to PCR with DNA 

15 primers having the sequences of SEQ ID NO: 15 and SEQ ID 

NO: 16, can detect polymorphisms between rice individuals of 
the japonica and indica lines depending on whether the 
amplification products have a recognition site for 
restriction enzyme Kpnl; and 

20 (9) marker 9: PCR marker S12564 Tsp509I (SEQ ID NO: 26) 

which, when rice genomic DNA is subjected to PCR with DNA 
primers having the sequences of SEQ ID NO: 17 and SEQ ID 
NO: 18, can detect polymorphisms between rice individuals of 
the japonica and indica lines depending on whether the 

25 amplification products have a recognition site for 
restriction enzyme Tsp509I. 



Assuming that the locus of Rf-1 gene was highly likely 
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to be located near the nine RFLP marker regions R1877, 
G291, R2303, S12564, C1361, S10019, G4003, S10602 and G2155 
on chromosome 10 of rice (see the results of RFLP linkage 
analysis described in Fukuta et al., 1992, Jpn. J. Breed. 
42 (supl. 1) 164-165 and the RFLP linkage map of rice 
described in Harushima et al. , 1998, Genetics, 148, 479- 
494), the present inventors converted those RFLP markers to 
co-dominant PCR markers such as CAPS markers or dCAPS 
markers as described below in Reference example 1 (Michaels 
and Amasino, 1998, The Plant Journal, 14(3), 381-385; Neff 
et al., 1998, The Plant Journal, 14(3), 387-392). As a 
result of this conversion, the PCR markers above have been 
obtained. 

Among these PCR markers, one group consisting of PCR 
markers R1877 EcoRI , G291 Mspl (SEQ ID NO:22), R2303 BslI 
(SEQ ID NO: 23) and S12564 Tsp509I (SEQ ID NO: 26) and the 
other group consisting of PCR markers C1361 Mwol (SEQ ID 
NO:20), S10019 BstUI (SEQ ID NO:24), G4003 Hindlll (SEQ ID 
NO: 19), S10602 Kpnl (SEQ ID NO:25) and G2155 Mwol (SEQ ID 
NO: 21) are on opposite sides of the locus of Rf-1 gene on 
chromosome 10 of rice. 

Therefore, in one embodiment, the presence of the Rf-1 
gene is detected by detecting Rf-1 linked bands by (a) at 
least one PCR marker selected from the group consisting of 
PCR markers R1877 EcoRI, G291 Mspl, R2303 BslI and S12564 
Tsp509I, and (b) at least one PCR marker selected from the 
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group consisting of PCR markers C1361 Mwol, S10019 BstUI, 
G4003 Hindlll, S10602 Kpnl and G2155 Mwol . In this case, 
at least S12564 Tsp509I from group (a) and at least C1361 
Mwol from group (b) are preferably used as the closest PCR 
5 markers to the Rf-1 gene. If Rf-1 linked bands are 

detected with PCR markers of both (a) and (b) in the genome 
of the rice under test, it can be estimated with a high 
probability that the rice contains Rf-1 gene. 

10 In another embodiment, Rf-1 linked bands are detected 

by at least two PCR markers of group (a) and at least two 
PCR markers of group (b) above. For example, a rice 
individual carrying the Rf-1 gene with a minimum of 
unwanted gene regions can be selected by picking up an 

15 individual in which Rf-1 linked bands are detected by 

markers of groups (a) and (b) more proximal to the Rf-1 
gene but not detected by markers of groups (a) and (b) more 
distal from the Rf-1 gene on the gene map shown in Fig. 1. 
Again, it is preferred that at least one PCR marker of 

20 group (a) is S12564 Tsp509I and at least one PCR marker of 
group (b) is C1361 Mwol. Thus, the two PCR marker loci 
S12564 Tsp509I and C1361 Mwol are separated by a genetic 
distance of 0.3 cM. By utilizing this characteristic, the 
chromosomal region that is introduced from the Rf-1 gene 

25 donor parent can be narrowed down to a size of about 1 cM. 
This helps minimize the possibility of introducing into the 
restorer line a deleterious gene that may be present in the 
neighborhood of Rf-1 gene in the donor parent. 
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p»t^ inn of the Rf-l gene 

In order to detect Rf-1 gene in the genome of a rice 
under test, any one of the above PCR markers is amplified 
from the genome of the rice by PCR using primers of SEQ ID 

5 NOS: 1-18 above and then detected by the polymerase chain 
reaction -restriction fragment length polymorphism method 
(PCR-RFLP). PCR-RFLP is a method that is applicable to the 
case where polymorphisms exist among variety lines at 
recognition sites of restriction enzymes in the sequences 

10 of PCR amplified DNA fragments and by which specific 

polymorphisms can conveniently be identified on the basis 
of cleavage patterns with those restriction enzymes (D.E. 
Harry et al.. Theor. Appl. Genet. (1998), 97:327-336) 



shown in Table 1 below on a visualized gel depending on the 
primer pair used. 



15 



Restriction enzyme cleavage patterns show the bands as 



Table 1 



20 



Approximate size 



^■p\ of flftl-finted band 



Detection of marker 1 (R1877 EcoRI ) 



with primer pair 1 



When the genome of test rice has 



25 



Rf-1 gene homozygously 



1500 and 1700 



When the genome of test rice has 



Rf-1 gene heterozygously 



1500, 1700 and 



3200 
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10 



15 



When the genome of test rice has 
no Rf-1 gene 



Detection of marker 3 (C1361 Mwol) 

with primer pair 3 

When the genome of test rice has 

Rf-1 gene homozygously 

When the genome of test rice has 

Rf-1 gene heterozygously 

When the genome of test rice has 

no Rf-1 gene 



Detection of marker 4 (G2155 Mwol) 
20 with primer pair 4 

When the genome of test rice has 

Rf-1 gene homozygously 

When the genome of test rice has 

Rf-1 gene heterozygously 
25 When the genome of test rice has 

no Rf-1 gene 



Detection of marker 5 (G291 Mspl) 
with primer pair 5 
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Detection of marker 2 (G4003 Hindlll) 
with primer pair 2 

When the genome of test rice has 

Rf-1 gene homozygously 

When the genome of test rice has 

Rf-1 gene heterozygously 

When the genome of test rice has 
no Rf-1 gene 



362 



95, 267 and 362 



; 9 5 and 267 



50 and 107 



25, 50, 79 and 107 



: 25, 50 a nd 79 



: 25, 27 and 78 



25, 27, 78 and 105 



; 2 5 a nd 105 



t 



When the genome of test rice has 

Rf-1 gene homozygously : 25, 4 9 and 55 

When the genome of test rice has 

Rf-1 gene heterozygously : 25 , 49, 55 and 104 

5 When the genome of test rice has 

no Rf-1 aene ; 25 and 104 

Detection of marker 6 (R2303 BslI) 
with primer pair 6 

When the genome of test rice has 
10 Rf-1 gene homozygously : 238, 655 and 679 

When the genome of test rice has 

Rf-1 gene heterozygously : 238, 655, 679 and 

1334 

When the genome of test rice has 

15 no Rf-1 gene : 238 and 1334 

Detection of marker 7 (S10019 BstUI) 
with primer pair 7 

When the genome of test rice has 

Rf-1 gene homozygously : 130, 218 and 244 

20 When the genome of test rice has 

Rf-1 gene heterozygously : 130, 218, 244 and 

462 

When the genome of test rice has 

no Rf-1 aene ; 130 and 462 

25 Detection of marker 8 (S10602 Kpnl) 
with primer pair 8 

When the genome of test rice has 
Rf-1 gene homozygously : 724 
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When the genome of test rice has 

Rf-1 gene heterozygously : 117, 607 and 724 

When the genome of test rice has 

no Rf-1 gene : 117 and 6Q7 

Detection of marker 9 (S12564 Tsp509I) 
with primer pair 9 

When the genome of test rice has 

Rf-1 gene homozygous ly : 41 and 117 

When the genome of test rice has 

Rf-1 gene heterozygously : 26, 41, 91 and 117 

When the genome of test rice has 

no Rf-1 aene . ; 26, 41 and 91 

TT. Iden tification of the Rf-1 locus 

As described above, Japanese Patent Application No. 
2000-247204 discloses RFLP-PCR markers based on our finding 
that the Rf-1 locus is mapped between DNA marker loci 
S12564 Tsp509I and C1361 Mwol . Fertility-restoring lines 
are established by backcrossing the Rf-1 gene into a normal 
japonica variety not containing the Rf-1 gene. If the 
method for identifying the Rf-1 locus described in Japanese 
Patent Application No. 2000-247204 is used during this 
process, not only the restoring lines can be established 
efficiently (within 2-3 years) but also the length of 
insert fragments can be controlled. 

However, introduction by crossing inevitably introduce 
regions proximal to Rf-1 at the same time. Japanese Patent 
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Application No. 2000-247204 showed that the Rf-1 locus is 
mapped between DNA marker loci S12564 Tsp509I and C1361 
Mwol, but the distance between both loci is about 0.3 cM, 
i.e. about 90 kbp. If a deleterious gene existed proximal 
5 to Rf-1, it would be undeniable that the deleterious gene 
might be inserted together with the Rf-1 gene. 

Thus, we searched for regions linked to the Rf-1 gene 
between DNA marker loci S12564 Tsp509I and C1361 Mwol by 
chromosomal walking and genetic analysis based on the close 
linkage between the Rf-1 locus and the DNA marker locus 
S12564 Tsp509I. As a result, we successfully identified 
the region of the Rf-1 locus including the Rf-1 gene upto 
about 76 kb and determined the entire base sequence of said 
region. According to the present invention, it is possible 
to introduce the function of a fertility restorer gene into 
BT male sterile cytoplasms by genetic engineering 
techniques . 

20 Specifically, in Japanese Patent Application No. 

2000-247204, linkage analyses on a population of 1042 
individuals prepared by pollinating MS Koshihikari with 
MS-FR Koshihikari (heterozygous at the Rf-1 locus) revealed 
one recombinant between the Rf-1 and S12564 Tsp509I loci 

25 and two recombinants between the Rf-1 and C1361 Mwol loci 

(Reference examples 1-2 herein). In the present invention, 
4103 individuals were added to the population to analyze a 
total of 5145 individuals. As a result, one recombinant 
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between the Rf-1 and S12564 Tsp509I loci and six 
recombinants between the Rf-1 and C1361 Mwol loci were 
newly found with a total of 2 and 8 recombinants. These 10 
individuals were tested by the high-precision segregation 
5 analysis of the present invention as recombinants proximal 
to the Rf-1 locus (Example 1). 

The frequency of 8 recombinants between the Rf-1 and 
C1361 Mwol loci as compared with 2 recombinants between the 

10 Rf-1 and S12564 Tsp509I loci means that the S12564 Tsp509I 
locus is genetically closer to the Rf-1 locus than the 
C1361 Mwol locus. Genetic distance (expressed in 
recombination frequency: cM) and physical distance 
(expressed in the number of base pairs: bp) are not always 

15 proportional to each other, but it can be normally expected 
that physical distance decreases with genetic distance. 

Thus, we tried to isolate the Rf-1 locus by 
chromosomal walking started from the S12564 Tsp509I locus 

20 (Example 2). Chromosomal walking was performed on a 

genomic library prepared from K DASH II vector using the 
genomic DNA of an indica variety IR24 and a japonica 
variety Asominori. IR24 is a variety carrying Rf-1, while 
Asominori is a variety not carrying Rf-1. As a result of 

2 5 chromosomal walking, contigs covering a chromosomal region 
of about 76 kb (ordered sets of overlapping clones on a 
chromosome) were able to be prepared from genomic clones of 
IR24, and the entire base sequence (76363 bp) thereof was 
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determined. 

Then, 12 markers were newly developed on the basis of 
the base sequence data or the like obtained and a high- 
precision segregation analysis was performed on the 10 
recombinants proximal to Rf-1 locus described above 
(Example 3). As a result, a 65 kb sequence included in the 
chromosomal region of about 76 kb above was shown to 
contain a sequence determining the presence of the function 
of the Rf-1 gene. This region is covered by a contig 
consisting of 8 genomic clones. Each clone has a length of 
about 12-22 kb and has overlapping domains of at least 
4.7 kb . Genes for rice are known to have a wide range of 
lengths (from short ones to large ones), but most of them 
seem to have a length of several kb or less. Thus, at 
least one of these 8 genomic clones is expected to contain 
the full-length Rf-1 gene. 

We further restricted the Rf-1 gene region in the 
20 chromosomal region of about 7 6 kb above and performed 

complementation assays to directly demonstrate the presence 
of a fertility restoring ability. 

Specifically, 10 partial fragments (each 10-21 kb) in 
25 the above region of 76 kb were separately introduced into 
immature seeds of a male sterility line MS Koshihikari by 
genetic engineering techniques (Fig. 5). Of the 10 partial 
fragments used, 8 fragments are derived from 8 genomic 
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clones previously obtained by chromosomal walking (XSE1 # 
XSE7 , XSF4, XSF20, XSG22 , XSG16 , XSG8 and XSH18 shown in 
Fig. 1 and described in Example 3). Additionally, 
fragments derived from 2 clones XSF18 and XSX1 were also 
5 analyzed by complementation assays. XSF18 is identical to 
XSF20 at the 5' and 3' ends (bases 20328 and 41921 of SEQ 
ID NO: 27, respectively), but lacks internal bases 33947- 
38591. This is because clone XSF18 was initially isolated 
but found to contain the above deletion during 

10 amplification after isolation, and therefore, the 

amplification step was freshly taken to isolate a complete 
clone designated XSF20 (Example 8). XSX1 is a clone 
freshly prepared from clones XSG8 and XSH18 by restriction 
enzyme treatment and ligation to contain sufficient 

15 overlapping domains because of the overlapping domains of 

both clones are relatively small (about 7 kb) (Example 13). 

If the insert fragment completely contains the Rf-1 
gene, transformed individuals at this generation restore 

20 fertility because Rf-1 is a dominant gene. In 

complementation assays plants transformed with each 
fragment were evaluated for seed fertility to find that 
those transformed with a 15.6 kb fragment (including bases 
38538-54123 of SEQ ID NO: 27) derived from the X phage clone 

25 XSG16 restored seed fertility (Example 10). Plants 

transformed with the other fragments were all sterile. 
These results showed that the above 15.6 kb fragment 
completely contains the Rf-1 gene. Moreover, a method for 
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introducing the Rf-1 gene by genetic engineering techniques 
was provided by the present invention and demonstrated to 
be effective. 

5 To further specify the region of the X phage clone 

XSG16 in which the Rf-1 gene is contained, we evaluated 
seed fertility of shorter fragments than the 15.6 kb 
fragment (including bases 38538-54123 of SEQ ID NO: 27) by 
complementation assays. As a result, plants transformed 

10 with a 11.4 kb fragment derived from XSG16 (including bases 
42357-53743 of SEQ ID NO:27) were shown to restore seed 
fertility (Example 10(2)). Plants transformed with a 
further shorter 6.8 kb fragment (including bases 42132- 
48883 of SEQ ID NO: 27) also restored seed fertility 

15 (Example 10(3)). These results showed that the above 
6.8 kb fragment contains the Rf-1 gene. 

The present inventors further continued studying, and 
identified the nucleic acid having the function to restore 

20 fertility. The amino acid sequence encoded by the nucleic 
acid then has been clarified. Specifically, DNA fragments 
corresponding to bases 43733-44038 and 48306-50226 of SEQ 
ID NO: 27 were first prepared by using PCR as described in 
Examples 14-15. The cDNA library prepared from the line 

25 wherein Rf-1 is introduced to Koshihikari was screened by 
using the above two DNA fragments as probes (Probe P and 
Q) . As a result, terminal base sequences of 6 clones are 
identical to the sequence of XSG16, and these 6 clones were 
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isolated as those containing the Rf-1 gene, and base 
sequences thereof were analyzed (SEQ ID NOS:69-74). 

All of the sequences, SEQ ID NOS:69-74 encode a 
5 protein having the amino acids 1-791 of SEQ ID NO: 75. 

Specifically, all and each of the 215-2587 of SEQ ID NO: 69, 
the bases 213-2585 of SEQ ID NO:70, the bases 218-2590 of 
SEQ ID NO: 71, the bases 208-2580 of SEQ ID NO: 72, the bases 
149-2521 of SEQ ID NO: 73 and the bases 225-2597 of SEQ ID 
10 NO: 74 encodes a protein having amino acids 1-791 of SEQ ID 
NO: 75. The above base sequences correspond to the bases 
43907-46279 of SEQ ID NO:27. 

The amino acid sequence of SEQ ID NO: 75 was compared 
15 with the presumed amino acid sequence of the corn fertility 
restorer gene (Rf2), and the N-terminal 7 amino acid 
residues (Met-Ala-Arg-Arg-Ala-Ala-Ser ) in both amino acid 
sequences were concurred. These 7 amino acid residues are 
considered to be a portion of a targeting signal to 
20 mitochondria (Liu et al., 2001). Based on the above facts, 
the cDNAs isolated on this occasion are considered to 
contain the full coding region of the Rf-1 gene. No 
homology between the amino acid sequences of the rice Rf-1 
and the corn Rf-2 can be found except for the above region. 

25 

In addition, the sequences of cDNAs isolated on this 
occasion were compared with the genome sequence of IR24 
(SEQ ID NO: 27), and the structures of exons and introns of 
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the Rf-1 gene were clarified (Fig. 7). As a result, it was 
shown that various transcription products wherein the 
splicing patterns and the poly A addition positions are 
different, are present in a plant body. There is no intron 
in the coding region of the Rf-1 gene. 

As for the 6.8 kb fragment which restored seed 
fertility in the complementary assay of Example 10 (3), the 
present inventors further pursued a complementary assay. 
Specifically, in Example 16, a 4 . 2 kb fragment (the bases 
42132-46318 of SEQ ID NO: 27) containing the promoter region 
and the presumed translation region of the Rf-1 gene 
within the above 6 . 8 kb fragment was subjected to a 
complementary assay, and the 4 . 2 kb fragment restored the 
seed fertility. 

Further, in Example 17, six new clones containing the 
nucleic acid having the fertility restorer function were 
obtained. Specifically, PCR was performed by using two 
primers corresponding to the bases 45522-45545 and 45955- 
45932 of SEQ ID NO: 27, and the genomic clone XSG16 of IR24 
as a template to obtain a DNA fragment. Plaque 
hybridization assays were performed by using the DNA 
fragment as Probe R and the above mentioned Probe P. Six 
clones were newly obtained (#7 - #12) from plaques which 
are positive for both Probe P and Probe R. The results 
were shown in SEQ ID NOS: 80-85. 
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All of the sequences. SEQ ID NOS:80-85 are presumed to 
encode a protein having the amino acids 1-791 of SEQ ID 
NO: 75. Specifically, all and each of the 229-2601 of SEQ 
ID NO: 80, the bases 175-2547 of SEQ ID NO: 81, the bases 
227-2599 of SEQ ID NO: 82, the bases 220-2592 of SEQ ID 
NO: 83, the bases 174-2546 of SEQ ID NO: 84 and the bases 
90-2462 of SEQ ID NO: 85 encodes a protein having amino 
acids 1-791 of SEQ ID NO: 75. The above base sequences 
correspond to the bases 43907-46279 of SEQ ID NO: 27. 

The sequences of cDNAs isolated on this occasion were 
compared with the genome sequence of IR24 (The Japanese 
Patent Application No. 2001-285247, SEQ ID NO: 27), and the 
structures of exons and introns were clarified (Fig.8). 
Among the cDNAs isolated on this occasion, there are three 
cDNAs which do not have any exons irrelevant to the 
presumed translation region, and consist of a single exon 
(#10 - #12, SEA ID NOS: 83-85). 

TTT. NuHp.Ic ac -trifi nontai n ina tne Rf-1 lOCUS 

The present invention provides nucleic acids 
containing the locus of a fertility restorer gene (Rf-1). 
The nucleic acids containing the locus of a fertility 
restorer gene (Rf-1) of the present invention include a 
nucleic acid having the base sequence of SEQ ID NO. 27, or 
nucleic acid having a base sequence which is identical to 
at least 70% of the base sequence of SEQ ID NO. 27, and 
which functions to restore fertility. Further, as 
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described in Example 10, it was confirmed that the Rf-1 
gene is completely contained in especially the bases 38538- 
54123 of the base sequence of SEQ ID NO: 27. Still further, 
the region containing the Rf-1 gene is determined to be, 
preferably the bases 38538-54123 of SEQ ID NO: 27, more 
preferably the bases 42357-53743, still preferably the 
bases 42132-48883, and still more preferably the bases 
42132-46318. 

The present inventors further pursued the study, and 
determined that the following regions as being nucleic 

acids containing the Rf-1 gene. 

a) the bases 215-2587 of SEQ ID NO: 69; 

b) the bases 213-2585 of SEQ ID NO: 70; 

c) the bases 218-2590 of SEQ ID NO: 71; 

d) the bases 208-2580 of SEQ ID NO: 72; 

e) the bases 149-2521 of SEQ ID NO: 73; 

f) the bases 225-2597 of SEQ ID NO:74; 

h) the bases 229-2601 of SEQ ID NO: 80; 

i) the bases 175-2547 of SEQ ID NO: 81; 
j) the bases 227-2599 of SEQ ID NO: 82; 
k) the bases 220-2592 of SEQ ID NO: 83; 

1) the bases 174-2546 of SEQ ID NO: 84; and 

m) the bases 90-2462 of SEQ ID NO: 85. 

The above base sequences correspond to g) the bases 
43907-46279 of SEQ ID NO: 27, and all of the bases encode 
the amino acid sequence 1-791 of SEQ ID NO:75. 
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Hereinafter, in the present specification, the term 
"the base sequence of SEQ ID NO: 27" refers to the whole SEQ 
ID NO: 27 or a portion thereof which takes part in the 
fertility restorer function, especially the bases 38538- 
54123. The term refers to more preferably the bases 42357- 
53743, still preferably the bases 42132-48883, and still 
more preferably the bases 42132-46318. And most 
preferably, it refers to g) the bases 43907-46279 of SEQ ID 
NO:27, or alternatively, a) the bases 215-2587 of SEQ ID 
NO:69, b) the bases 213-2585 of SEQ ID NO:70, c) the 
bases 218-2590 of SEQ ID NO: 71, d) the bases 208-2580 of 
SEQ ID NO:72, e) the bases 149-2521 of SEQ ID NO:73, f) 
the bases 225-2597 of SEQ ID NO: 74, h) the bases 229-2601 
of SEQ ID,NO:80, i) the bases 175-2547 of SEQ ID NO:81, j) 
the bases 227-2599 of SEQ ID NO:82, k) the bases 220-2592 
of SEQ ID NO: 83, 1) the bases 174-2546 of SEQ ID NO: 84 or 
m) the bases 90-2462 of SEQ ID NO: 85 corresponding 
thereto . 

In the examples below, a nucleic acid was isolated 
from a genomic library of indica rice IR24 containing the 
Rf-1 gene as a nucleic acid containing a fertility restorer 
gene (Rf-1) and determined to have the base sequence of SEQ 
ID NO: 27. However, the nucleic acid containing a fertility 
restorer gene (Rf-1) of the present invention can be 
derived from any indica variety carrying the Rf-1 gene. 
The indica varieties carrying the Rf-1 gene include, but 
not specifically limited to, e.g. IR24, IR8, IR36, IR64, 
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Chinsurah and BoroII. Known japonica varieties not 
carrying the Rf-1 gene include, but not limited to, 
Asominori, Koshihikari, Kirara 397, Akihikari, 
Akitakomachi , Sasanishiki, Kinuhikari, Nipponbare, 
Hatsuboshi, Koganebare, Hinohikari, Mineasahi, 
Aichinokaori , Hatsushimo, Akebono, Fujihikari, 
Minenoyukimochi , Kokonoemochi , Fukuhibiki , Dontokoi , 
Gohyakumangoku , Hanaechizen , Todorokiwase , Haenuki , 
Domannaka, Yamakikari, etc. The "indica" and "japonica" 
varieties are well known to those skilled in the art and 
the rice varieties encompassed by the present invention can 
be readily determined by those skilled in the art. 

Nucleic acids of the present invention include DNA in 
both single- stranded and double-stranded forms, as well as 
the RNA complement thereof. DNA includes, for example, 
genomic DNA (including corresponding cDNA) , chemically 
synthesized DNA, DNA amplified by PCR, and combinations 
thereof . 

Nucleic acids containing the Rf-1 gene of the present 
invention preferably have the base sequence of SEQ ID 
NO: 27. More than one codon may encode the same amino acid, 
and this is called degeneracy of the genetic code. Thus, a 
DNA sequence not completely identical to SEQ ID NO: 27 may 
encode a protein having an amino acid sequence completely 
identical to SEQ ID NO: 27. Such a variant DNA sequence may 
result from silent mutation (e.g., occurring during PCR 



amplification), or can be a product of deliberate 
mutagenesis of a native sequence. 

Preferably, the Rf-1 gene of the present invention 
encodes the amino acid sequence described in SEQ ID NO: 75. 
However, it is not limited thereto, and may encode an amino 
acid sequence wherein one or more amino acid residues are 
deleted, added or substituted. 

The protein of the present invention is intended to 
include any homologous proteins as long as they have the 
fertility restorer function. The "amino acid variation" 
occurs at one or a plurality of amino acids residues, 
preferably 1-20, more preferably 1-10, most preferably 1-5 
amino acid residues. The amino acid sequence encoded by 
the Rf-1 gene has an identity of at least about 70%, 
preferably about 80% or more, more preferably about 90% or 
more, still preferably about 95% or more, and most 
preferably about 98% or more with the amino acid sequence 
of SEQ ID NO: 75. 

The percent identity of the amino acids can be 
determined by visual inspection and mathematical 
calculation. The percent identity between two protein 
sequences may be determined by comparing sequence 
information based on the algorithm of Needleman, S. B. and 
Wunsch, CD. ( J.Mol.Biol. , 48: 443-453, 1970) and using the 
GAP computer program available from University of Wisconsin 



Genetics Computer Group (UWGCG) . The preferred default 
parameters for the "GAP" program include: (1) a scoring 
matrix as described in Henikoff, S and Henikoff, J. G. 
( Proc. Natl. Acad. Sci. USA, 89: 10915-10919, 1992), blosum 
62; (2) a penalty of 12 for each gap; (3) a penalty of 4 
for each length of each gap; and (4) no penalty for end 
gaps . 

Other programs used by those skilled in the art for 
sequence comparison can also be used. For example, the 
percent identity may be determined by comparing sequence 
information using the BLAST program described in Altschul 
et al. (Nucl. Acids. Res. 25., p. 3389-3402, 1997). The 
program is available from the web site of National Center 
for Biotechnology Information (NCBI), or the web site of 
DNA Data bank of Japan (DDBJ) on the Internet. Various 
factors (parameters) for the homology research via the 
BLAST program are described in detail on the sites. A 
research is generally performed by using the default 
parameters, although some setting may be appropriately 
modified. 

It is well known for those skilled in the art that 
even proteins having the same function may have different 
amino acid sequences depending on the varieties from which 
they are derived. The Rf-1 gene of the present invention 
includes such homologs and variants of the base sequence of 
SEQ ID NO: 27 so far as they function to restore fertility. 
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The expression "function to restore fertility" means that 
fertility is conferred on a rice individual or seed when 
such a DNA fragment is introduced. Fertility restoration 
may result from the expression of a protein by the Rf-1 
gene or some function of the nucleic acid (DNA or RNA) per 
se of the Rf-1 gene in conferring fertility. 

Whether or not a homolog or variant of the Rf-1 gene 
functions to restore fertility can be examined by, but not 
limited to, the following method, for example. A nucleic 
acid fragment under test is introduced into immature seeds 
obtained by pollinating MS Koshihikari (sterile line) with 
MS-FR Koshihikari according to the method of Hiei et al. 
(Plant Journal (1994), 6(2), p. 272-282). As the resulting 
transf ormants are cultured under normal conditions, the 
seeds mature only when the nucleic acid fragment under test 
functions to restore fertility. 

The nucleic acid derived from a corresponding region 
of japonica Asominori not carrying the Rf-1 gene has the 
base sequence shown in SEQ ID NO: 28. Corresponding parts 
of SEQ ID NO: 28 and SEQ ID NO: 27 have an overall identity 
of about 98%. Thus, nucleic acids containing the locus of 
a fertility restorer gene (Rf-1) of the present invention 
are at least about 70%, preferably about 80% or more, more 
preferably 90% or more, still more preferably 95% or more, 
most preferably 98 or more % identical to SEQ ID NO: 27. 
Especially, the term "SEQ ID NO: 27" intends to mean any one 
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of g) the bases 43907-46279 of SEQ ID NO:27, or 
alternatively, a) the bases 215-2587 of SEQ ID NO: 69, b) 
the bases 213-2585 of SEQ ID NO:70, c) the bases 218-2590 
of SEQ ID NO:71, d) the bases 208-2580 of SEQ ID NO:72, e) 
the bases 149-2521 of SEQ ID NO:73, f) the bases 225-2597 
of SEQ ID NO:74, h) the bases 229-2601 of SEQ ID NO:80, i) 
the bases 175-2547 of SEQ ID NO: 81, j) the bases 227-2599 
of SEQ ID NO:82, k) the bases 220-2592 of SEQ ID NO:83, 1) 
the bases 174-2546 of SEQ ID NO: 84 or m) the bases 90-2462 
of SEQ ID NO: 85 corresponding thereto. 

The percent identity of a nucleic acid may be 
determined by visual inspection and mathematical 
calculation. Alternatively, the percent identity of two 
nucleic acid sequences can be determined by comparing 
sequence information using the GAP computer program, 
version 6.0 described by Devereux et al., Nucl. Acids 
Res., 12:387 (1984) and available from the University of 
Wisconsin Genetics Computer Group (UWGCG) . The preferred 
default parameters for the GAP program include: (1) a unary 
comparison matrix (containing a value of 1 for identities 
and 0 for non-identities) for bases, and the weighted 
comparison matrix of Gribskov and Burgess, Nucl. Acids 
Res., 14:6745 (1986), as described by Schwartz and Dayhoff, 
eds., Atlas of Protein Sequence and Structure, National 
Biomedical Research Foundation, pp. 353-358 (1979); (2) a 
penalty of 3.0 for each gap and an additional 0.10 penalty 
for each symbol in each gap; and (3) no penalty for end 
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gaps. Other programs used by those skilled in the art of 
sequence comparison may also be used. 

Nucleic acids of the present invention also include 
5 nucleic acids which are capable of hybridizing to the base 
sequence of SEQ ID NO: 27 under conditions of moderately 
stringent conditions and functions to restore fertility, 
and nucleic acids which are capable of hybridizing to the 
base sequence of SEQ ID NO: 27 under conditions of highly 
10 stringent conditions and functions to restore fertility. 

As used herein, conditions of moderate stringency can 
be readily determined by those having ordinary skill in the 
art based on, for example, the length of the DNA. The 

15 basic conditions are set forth by Sambrook et al. 

Molecular Cloning: A Laboratory Manual, 2nd. Vol. 1, pp. 
1.101-104, Cold Spring Harbor Laboratory Press, (1989), and 
include use of a prewashing solution for the nitrocellulose 
filters 5 x SSC, 0.5% SDS, 1.0 mM EDTA (pH 8.0), 

20 hybridization conditions of about 1 x SSC to 6 x SSC at 

about 40°C to 60°C (or other similar hybridization solution, 
such as Stark's solution, in about 50% formamide at about 
42°C), and washing conditions of about 60°C, 0.5 x SSC, 0.1% 
SDS. The hybridization temperature is about 15-20°C lower 

25 when the hybridization solution contains about 50% 

formamide. Conditions of high stringency can also be 
readily determined by the skilled artisan based on, for 
example, the length of the DNA. Generally, conditions of 
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high stringency include hybridization and/or washing 
conditions at higher temperatures and/or lower salt 
concentrations than in the conditions of moderate 
stringency described above. For example, such conditions 
include hybridization conditions of 0.1 x SSC to 0.2 x SSC 
at about 60-65°C and/or washing conditions of 0 . 2 x SSC, 
0.1% SDS at about 65-68°C. The skilled artisan will 
recognize that the temperature and wash solution salt 
concentration can be adjusted as necessary according to 
factors such as the length of the probe. 

Especially preferably, "SEQ ID NO: 2 7" intends to mean 
any one of g) the bases 43907-46279 of SEQ ID NO: 27, or 
alternatively, a) the bases 215-2587 of SEQ ID NO: 69, b) 
the bases 213-2585 of SEQ ID NO: 70, c) the bases 218-2590 
of SEQ ID NO: 71, d) the bases 208-2580 of SEQ ID NO: 72, e) 
the bases 149-2521 of SEQ ID NO: 73, f) the bases 225-2597 
of SEQ ID NO:74, h) the bases 229-2601 of SEQ ID NO:80, i) 
the bases 175-2547 of SEQ ID NO: 81, j) the bases 227-2599 
of SEQ ID NO:82, k) the bases 220-2592 of SEQ ID NO:83, 1) 
the bases 174-2546 of SEQ ID NO: 84 or m) the bases 90-2462 
of SEQ ID NO: 85 corresponding thereto. 

DNAs of the present invention also include nucleic 
acids that differ from the base sequence of SEQ ID NO: 27 
due to deletions, insertions or substitutions of one or 
more bases while retaining a fertility restoring function. 
So far as a fertility restoring function is retained, the 
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number of bases to be deleted, inserted or substituted is 
not specifically limited, but preferably 1 to several 
thousands, more preferably 1-1000, still more preferably 
1-500, even more preferably 1-200, most preferably 1-100. 

The Rf-1 gene has further been specified on the basis 
of the descriptions herein, and it can be used by those 
skilled in the art after nucleic acids such as other 
regions than the Rf-1 gene or intron regions in the Rf-1 
gene are removed. A given amino acid (especially, the 
amino acid sequence of SEQ ID NO: 75) may be replaced, for 
example, by a residue having similar physicochemical 
characteristics. Examples of such conservative 
substitutions include changes from one aliphatic residue to 
another, such as changes from one to another of lie, Val, 
Leu, or Ala; changes from one polar residue to another, 
such as changes between Lys and Arg, Glu and Asp, or Gin 
and Asn; or changes from one aromatic residue to another, 
such as changes from one to another of Phe, Trp, or Tyr. 
Other well-known conservative substitutions include e.g. 
changes between entire regions having similar hydrophobic 
characteristics. Those skilled in the art can introduce 
desired deletions, insertions or substitutions by well- 
known gene engineering techniques using e.g. site-specific 
mutagenesis as described in Sambrook et al.. Molecular 
Cloning: A Laboratory Manual, 2nd edition. Cold Spring 
Harbor Laboratory Press, (1989). 
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We compared an indica variety IR24 carrying the Rf-1 
gene (SEQ ID NO: 27) with japonica varieties not carrying it 
such as Asominori (SEQ ID NO: 28) and a Nipponbare BAC clone 
deposited with GenBank (Accession No. AC068923). As a 
5 result, we found that the Rf-1 region of the indica variety 
containing the Rf-1 gene has at least the following single 
bases polymorphisms (SNP). 

1) a base corresponding to the base 1239 of SEQ ID 
NO:27 is A; 

10 2) a base corresponding to the base 6227 of SEQ ID 

NO: 27 is A; 

3) a base corresponding to the base 20680 of SEQ ID 
NO: 27 is G; 

4) a base corresponding to the base 45461 of SEQ ID 
15 NO:27 is A; 

5) a base corresponding to the base 49609 of SEQ ID 

NO: 27 is A; 

6) a base corresponding to the base 56368 of SEQ ID 
N0:27 is T; 

20 7) a base corresponding to the base 57629 of SEQ ID 

NO: 27 is C; and 

8) a base corresponding to the base 66267 of SEQ ID 
NO: 27 is G. 

25 Thus, nucleic acids containing the Rf-1 region of the 

present invention preferably meet one to all of the 
requirements l)-8) above. 
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In Example 3 below, the chromosomal organizations of 
recombinants proximal to the Rf-1 gene (RS1-RS2, RC1-RC8) 
were tested in the Rf-1 region. The results showed that a 
sequence determining the presence of the function of the 
5 Rf-1 gene is contained in the base sequence of bases 1239- 
66267 of SEQ ID NO: 27, i.e. in a region from the P4497 Mbol 
to B56691 Xbal loci (about 65 kb) as estimated at maximum 
(Fig. 3). However, there is a possibility that it is 
important for the expression of the genetic function of the 

10 Rf-1 gene that the Rf-1 gene is partially of the indica 
genotype, and that the genetic function may not be 
significantly changed whether the remaining regions are of 
the japonica or indica genotype. There may be an extreme 
case that the coding region is completely identical and 

15 only the promoter region is different between japonica and 
indica, and that the promoter region and the coding region 
are only partially included in the region from P4497 the 
Mbol to B56691 Xbal loci (about 65 kb) . Therefore, it 
cannot be concluded that the common indica region above 

20 (bases 1239-66267 of SEQ ID NO: 27) completely contains the 
entire Rf-1 gene. However, it is thought that at least SEQ 
ID NO: 27 completely contains the entire Rf-1 gene for the 
following reasons : 

1) the size of a gene is normally several kilobases, 
25 and rarely exceeds 10 kb; 

2) the genomic base sequence of IR24 determined by the 
present invention (SEQ ID NO: 27) completely contains the 
common indica region above; 
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3) the 5' end of SEQ ID NO: 27 is located 1238 bp 
upstream of the 5' end of the common indica region above 
and forms a part of another gene (S12564); and 

4) the 3' end of SEQ ID NO: 27 is located 10096 bp 
downstream of the 3' end of the common indica region above. 

In this way, we first succeeded in restricting the 
region of the Rf-1 gene to 76 kb. Thus, nucleic acids 
containing the region of the Rf-1 gene of the present 
invention are extremely less likely to contain other genes 
proximal to the Rf-1 gene as compared with those selected 
with the co-dominant marker locus at a genetic distance of 
about 1 cM (about 300 kb) from the Rf-1 gene described in a 
prior documents such as Japanese Patent Public Disclosure 
No. 2000-139465. Moreover, they are less likely to contain 
other genes than those selected with the DNA marker loci 
S12564 Tsp509I and C1361 Mwol (at a distance of about 0.3 
cM between them) described in our prior Japanese Patent 
Application No. 2000-247204. 

We further confirmed by complementation assays that 
the Rf-1 gene is completely contained in especially bases 
38538-54123 of the base sequence of SEQ ID NO: 27. In an 
embodiment of the present invention, therefore, the base 
sequence at least 70% identical to the base sequence of SEQ 
ID NO: 27 or to the base sequence of bases 38538-54123 of 
SEQ ID NO: 27 meets at least one of the following 
requirements 1) and 2): 



1) a base corresponding to the base 45461 of SEQ ID 
NO: 27 is A; 

2) a base corresponding to the base 49609 of SEQ ID 
NO: 27 is A. 



10 



15 



20 



25 



The present inventors further determined that the 
following regions as being nucleic acids containing the Rf- 
1 gene. 



a ) 


the 


bases 


215 


-2587 


of 


SEQ 


ID 


NO: 


69 


b) 


the 


bases 


213 


-2585 


of 


SEQ 


ID 


NO: 


70 


c) 


the 


bases 


218 


-2590 


of 


SEQ 


ID 


NO: 


71 


d) 


the 


bases 


208 


-2580 


of 


SEQ 


ID 


NO: 


72 


e) 


the 


bases 


149 


-2521 


Of 


SEQ 


ID 


NO: 


73 


f ) 


the 


bases 


225 


-2597 


of 


SEQ 


ID 


NO: 


74 


h) 


the 


bases 


229 


-2601 


of 


SEQ 


ID 


NO: 


80 


i) 


the 


bases 


175 


-2547 


of 


SEQ 


ID 


NO: 


81 


3) 


the 


bases 


227 


-2599 


of 


SEQ 


ID 


NO: 


82 


k) 


the 


bases 


220 


-2592 


of 


SEQ 


ID 


NO: 


83 


1) 


the 


bases 


174 


-2546 


of 


SEQ 


ID 


NO: 


84 


m) 


the 


bases 


90- 


2462 


of 


SEQ ID NO: 85. 


The 


above base sequences 


correspond 


to 



and 



43907-46279 of SEQ ID NO: 27. The nucleic acids of the 
present invention further include 

n) a nucleic acid which is identical to at least 70% 
of the nucleic acid of any of a) - m) , and which functions 
to restore fertility; 

o) a nucleic acid which hybridizes to the nucleic 
acid of any of a) - m) under a moderate or high stringent 
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condition, and which functions to restore fertility; and 

p) a nucleic acid wherein one or a plurality of 
base(s) is deleted from, added to or substituted from the 
nucleic acid of any of a) - m>, and which functions to 
5 restore fertility. 

The base 45461 of SEQ ID NO: 27 corresponds to 1) the 
base 1769 of SEQ ID NO. 69; 2) the base 1767 of SEQ ID 
NO. 70; 3) the base 1772 of SEQ ID NO. 71; 4) the base 1762 
10 of SEQ ID NO. 72; 5) the base 1703 of SEQ ID NO. 73; 6) the 
base 1779 of SEQ ID NO. 74; 7) the base 1783 of SEQ ID 
NO. 80; 8) the base 1729 of SEQ ID NO. 81; 9) the base 1781 
of SEQ ID NO. 82; 10) the base 1774 of SEQ ID NO. 83; 11) the 
base 1728 of SEQ ID NO. 84; and 12) the base 1644 of SEQ ID 
15 NO. 85. Accordingly, especially preferably, the nucleic 

acid used for the method of the present invention meets at 
least one of the following requirements 1) - 12): 

1) a base corresponding to the base 1769 of SEQ ID 
NO. 69 is A; 

20 2 ) a base corresponding to the base 1767 of SEQ ID 

NO. 70 is A; 

3) a base corresponding to the base 1772 of SEQ ID 
NO. 71 is A; 

4) a base corresponding to the base 1762 of SEQ ID 

25 NO. 72 is A; 

5) a base corresponding to the base 1703 of SEQ ID 

NO. 73 is A; 

6) a base corresponding to the base 1779 of SEQ ID 
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NO. 74 is A; 

7) a base corresponding to the base 1783 of SEQ ID 
NO. 80 is A; 

8) a base corresponding to the base 1729 of SEQ ID 
NO. 81 is A; 

9) a base corresponding to the base 1781 of SEQ ID 
NO. 82 is A; 

10) a base corresponding to the base 1774 of SEQ ID 
NO. 83 is A; 

11) a base corresponding to the base 1728 of SEQ ID 

NO. 84 is A; or 

12) a base corresponding to the base 1644 of SEQ ID 

NO. 85 is A. 

TV. Mftthnrl for r ftatnrlnn rice fertility 

The present invention provides a method for restoring 
rice fertility comprising introducing a nucleic acid into 
rice, wherein the nucleic acid has the base sequence of SEQ 
ID NO. 27, or has a base sequence which is identical to at 
least 70% of the base sequence of SEQ ID NO. 27, and which 
functions to restore fertility. The methods of the present 
invention may comprise introducing a nucleic acid into 
rice, wherein the nucleic acid has a portion of SEQ ID 
NO: 27, especially the bases 38538-54123, preferably the 
bases 42357-53743, more preferably the bases 42132-48883 of 
SEQ ID NO: 27 or has a base sequence which is at least 70% 
identical to the base sequence of bases 38538-54123, 
preferably the bases 42357-53743, more preferably the bases 
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42132-48883 of SEQ ID NO: 27, still more preferably the 
bases 42132-46318 and, which functions to restore 
fertility. 

In a particularly preferable embodiment of the present 
method, the nucleic acid encodes the amino acid sequence of 
SEQ ID NO. 75, or an amino acid sequence which is identical 
to at least 70% of the amino acid sequence of SEQ ID NO. 75, 
and which functions to restore fertility is introduced into 
rice. Most preferably, the nucleic acid encoding the amino 
acid sequence of SEQ ID NO. 75, or an amino acid sequence 
which is identical to at least 70% of the amino acid 
sequence of SEQ ID NO. 7 5 is selected from nucleic acids of 

the following a) - p) : 

a) a nucleic acid comprising the bases 215-2587 of 

SEQ ID NO: 69; 

b) a nucleic acid comprising the bases 213-2585 of 

SEQ ID NO: 70; 

c) a nucleic acid comprising the bases 218-2590 of 

SEQ ID NO: 71; 

d) a nucleic acid comprising the bases 208-2580 of 

SEQ ID NO: 72; 

e) a nucleic acid comprising the bases 149-2521 of 

SEQ ID NO: 73; 

f ) a nucleic acid comprising the bases 225-2597 of 

SEQ ID NO: 74; 

g) a nucleic acid comprising the bases 43907-46279 of 
SEQ ID NO: 27; 



h) a nucleic acid comprising the bases 229-2601 of 
SEQ ID NO: 80; 

i) a nucleic acid comprising the bases 175-2547 of 
SEQ ID NO: 81; 

j) a nucleic acid comprising the bases 227-2599 of 
SEQ ID NO: 82; 

k) a nucleic acid comprising the bases 220-2592 of 
SEQ ID NO: 83; 

1) a nucleic acid comprising the bases 174-2546 of 
SEQ ID NO: 84; 

m) a nucleic acid comprising the bases 90-2462 of SEQ 
ID NO: 85; 

n) a nucleic acid which is identical to at least 70% 
of the nucleic acid of any of a) - m) , and which functions 
to restore fertility; 

o) a nucleic acid which hybridizes to the nucleic 
acid of any of a) - m) under a moderate or high stringent 
condition, and which functions to restore fertility; and 

p) a nucleic acid wherein one or a plurality of 
base(s) is deleted from, added to or substituted from the 
nucleic acid of any of a) - m) , and which functions to 
restore fertility. 

In the present invention, the nucleic acid containing 
the locus of a fertility restorer gene (Rf-1) that can be 
introduced into rice can be any one of the nucleic acids 
described above in "III. Nucleic acids containing the Rf-1 
locus". The method for introducing the nucleic acid into 
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rice is not specifically limited but can be any known 
method. Nucleic acids of the present invention can be 
introduced by known genetic engineering techniques or 
crossing. Genetic engineering techniques are preferably 
used because inclusion of other neighboring genes can be 
prevented and the period for establishing a line can be 
shortened. 

Any suitable expression system for transduction by 
genetic engineering techniques can be employed. 
Recombinant expression vectors comprise a nucleic acid 
containing a fertility restorer gene (Rf-1) of the 
invention that can be introduced into rice, operably linked 
to suitable transcriptional or translational regulatory 
base sequences, such as those derived from a mammalian, 
microbial, viral, or insect gene. 

Examples of regulatory sequences include 
transcriptional promoters, operators, or enhancers, an mRNA 
ribosomal binding site, and appropriate sequences which 
control transcription and translation initiation and 
termination. Base sequences are operably linked to a 
regulatory sequence when the regulatory sequence is 
functionally associated with the DNA sequences. Thus, a 
promoter base sequence is operably linked to a DNA sequence 
if the promoter base sequence controls the transcription of 
the DNA sequence. An origin of replication that confers 
the ability to replicate in rice, and a selection gene by 



which transf ormants are identified, are generally 
incorporated into expression vectors. As for selectable 
markers, those commonly used can be used by standard 
methods. Examples are genes resistant to antibiotics such 
as tetracycline, ampicillin, kanamycin, neomycin, 
hygromycin or spectinomycin . 

In addition, a sequence encoding an appropriate signal 
peptide (native or heterologous) can be incorporated into 
expression vectors. A DNA sequence for a signal peptide 
(secretory leader) may be fused in frame to a nucleic acid 
sequence of the invention so that the DNA is initially 
transcribed, and the mRNA translated into a fusion protein 
containing the signal peptide. 

The present invention also provides recombinant 
vectors containing a gene of the present invention. 
Methods for integrating a DNA fragment of a gene of the 
present invention into a vector such as a plasmid are 
described in e.g. Sambrook, J. et al. Molecular Cloning, A 
Laboratory Manual (2nd edition), Cold Spring Harbor 
Laboratory, 1.53 (1989). Commercially available ligation 
kits (e.g. available from TAKARA) can be conveniently used. 
Thus obtained recombinant vectors (e.g. recombinant 
plasmids) are transferred into host rice cells. 

Vectors can be conveniently prepared by linking a 
desired gene to a recombinant vector available in the art 



(e.g. plasmid DNA) by standard methods. Plant transforming 
vectors are especially useful for conferring fertility on 
rice using a nucleic acid fragment of the present 
invention. Vectors for plants are not specifically limited 
so far as they can express the gene of interest in plant 
cells to produce the protein, but preferably include 
pBI221, pBI121 (Clontech), and vectors derived therefrom. 
Especially, examples of vectors for transforming rice 
belonging to monocotyledons include pIG121Hm and pTOK233 
(Hiei et al.. Plant J., 6, 271-282 (1994)), and pSB424 
(Komari et al. , Plant J., 10, 165-174 (1996)). 

Transgenic plants can be prepared by replacing the 
(3- glucuronidase (GUS) gene in the above vectors with a 
nucleic acid fragment of the present invention to construct 
a plant transforming vector and transfecting it into a 
plant. The plant transforming vector preferably comprises 
at least a promoter, a start codon, a desired gene (a 
nucleic acid sequence of the present invention or a part 
thereof), a stop codon and a terminator. It may also 
contain a DNA encoding a signal peptide, an enhancer 
sequence, non-translated 5' and 3' regions of the desired 
gene, a selectable marker region, etc., as appropriate. 
Promoters and terminators are not specifically limited so 
far as they are functional in plant cells, among which 
constitutive expression promoters include the 35S promoter 
initially contained in the above vectors as well as 
promoters for actin and ubiquitin genes. 



Suitable methods for introducing a plasmid into a host 
cell include the use of calcium phosphate or calcium 
chloride /rubidium chloride, electroporation, 
electroinjection, chemical treatment with PEG or the like, 
the use of a gene gun described in Sambrook, J. et al. , 
Molecular Cloning, A Laboratory Manual (2nd edition). Cold 
Spring Harbor Laboratory. 1.74(1989). Plant cells can be 
transformed by e.g. the leaf disc method [Science, 227. 129 
(1985)] or electroporation [Nature, 319. 791 (1986)]. 

Methods for transferring a gene into a plant include 
the use of Agrobacterium (Horsch et al. , Science, 
227,129(1985); Hiei et al. , Plant J.. 6, 271-282(1994)). 
electroporation (Fromm et al.. Nature. 319. 791(1986)). PEG 
(Paszkowski et al.. EMBO J., 3, 2717(1984)). microinjection 
(Crossway et al. , Mol. Gen. Genet., 202, 179 (1986)). 
particle bombardment (McCabe et al. , Bio/Technology, 6, 
923(1988)). Methods are not specifically limited so far as 
they are suitable for transfecting a nucleic acid into a 
desired plant. 

Transduction by crossing can be performed as follows, 
for example. First, F x obtained by crossing an Rf-1 donor 
parent and a japonica variety is backcrossed with the 
japonica variety. The resulting individuals are screened 
for those homozygous for japonica at the S12564 Tsp509I 
locus and heterozygous at the P4497 Mbol and B53627 BstZ17I 
loci and further backcrossed. The resulting individuals 
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are screened for those heterozygous at the P4 497 Mbol and 
B56691 Xbal loci and homozygous for japonica at the B53627 
BstZ17I locus and further backcrossed. Subsequently, about 
10 cycles of screening each backcrossed generation for 
5. individuals heterozygous at the P4497 Mbol and B56691 Xbal 
loci and subjecting them to the subsequent backcrossing are 
repeated. Finally, individuals heterozygous at the P4497 
Mbol and B56691 Xbal loci are self -fertilized and the 
resulting individuals are screened for those homozygous for 
10 indica at both loci, whereby a restorer line inheriting a 
limited chromosomal region from the P4497 Mbol to B56691 
Xbal loci from the Rf-1 donor parent can be obtained. 

According to the present invention, nucleic acids 
15 containing a fertility restorer gene (Rf-1) were isolated, 
whereby the Rf-1 gene can be introduced into a rice variety 
using genetic engineering techniques to establish a 
restorer line. The present invention succeeded in 
restricting the Rf-1 region to 76 kb or less in the first 
20 place. Therefore, nucleic acids containing the Rf-1 locus 
of the present invention are extremely less likely to 
contain other genes neighboring the Rf-1 gene than those of 
the prior art. Moreover, the entire base sequence of the 
region containing the Rf-1 gene was determined by the 
25 present invention. Those skilled in the art can proceed 
with analysis of the Rf-1 gene itself on the basis of the 
description herein. Thus, only the Rf-1 gene can be 
introduced without including any neighboring gene. This is 
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especially important when neighboring genes bring 
deleterious traits. Furthermore, restorer lines can be 
established in a shorter period such as 1-2 years than 
obtained by crossing. 

5 

In complementation assays described in Examples 4-13 
and 17 herein, MS Koshihikari (having BT cytoplasm and a 
core gene substantially identical to Koshihikari) was 
actually transformed by an Agrobacterium-mediated method 

10 using fragments from 10 clones described in Fig. 5. The 
results demonstrated that fertility restorer lines can be 
established from a nucleic acid containing the base 
sequence of the bases 38538-54123, preferably the bases 
42357-53743, more preferably the bases 42132-48883, still 

15 more preferably the bases 42132-46318 of SEQ ID NO: 27. 

Agrobacterium-mediated methods for establishing rice 
restorer lines are described in, but not limited to, Hiei 
et al., Plant J., 6, pp. 271-282(1994), Komari et al. , Plant 
20 J., 10, p. 165-174(1996) , Ditta et al. , Proc. Natl. Acad. 
Sci. USA 77: pp. 7347-7351(1980), etc. 

First, a plasmid vector containing a nucleic acid 
fragment of interest to be inserted is prepared. Suitable 
25 plasmid vectors include e.g. pSBll, pSB22 and the like 

having a plasmid map described in Komari et al.. Plant J., 
10, pp. 165-174 (1996), supra. Alternatively, those 
skilled in the art can also construct an appropriate vector 



by themselves on the basis of plasmid vectors such as 
pSBll, pSB22 described above. In the examples herein 
below, an intermediate vector pSB200 having a hygromycin- 
resistant gene cassette was prepared on the basis of pSBll. 
5 and used. Specifically, a nopaline synthase terminator 
(Tnos) was first fused to a ubiquitin promoter and a 
ubiquitin intron (Pubi-ubil). A hygromycin-resistant gene 
(HYG(R) ) was inserted between ubil and Tnos of the 
resulting Pubi-ubil -Tnos complex to give a Pubi-ubil - 
10 HYG(R) -Tnos assembly. This assembly was fused to a 

Hindlll/EcoRI fragment of pSBll (Komari et al. , supra.) to 
give pKY205. Linker sequences for adding restriction 
enzyme sites Not I. NspV, EcoRV. Kpnl . Sacl. EcoRI were 
inserted into the Hind III site upstream of Pubi of this 
15 pKY205 to give pSB200 having a hygromycin-resistant gene 
cassette . 

Then, E. coli cells (e.g. DH5a, JM109, MV1184, all 
commercially available from e.g. TAKARA) are transformed 
20 with the recombinant vector containing the nucleic acid 
inserted. 

Thus transformed E. coli cells are used for 
triparental mating with an Agrobacterium strain preferably 
25 in combination with a helper E. coli strain according to 
e.g. the method of Ditta et al. (1980). Suitable 
Agrobacterium strains include Agrobacterium tumefaciens 
strains such as LBA4404/pSBl , LBA4404/pNBl , LBA4404/pSB3 , 
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etc. They all have a plasmid map described in Komari et 
al.. Plant J., 10, pp. 165-174 (1996), supra, and can be 
used by those skilled in the art by constructing a vector 
by themselves. Suitable helper E. coli strains include, 
5 but not limited to, e.g. HB101/pRK2013 (available from 
Clontech) . A report shows that E. coli cells carrying 
pRK2073 can also be used as helper E. coli though they are 
less common (Lemas et al. , Plasmid 1992, 27, pp. 161-163). 

Then, the Agrobacterium cells mated as intended are 
transformed into male sterility rice according to e.g. the 
method of Hiei et al (1994). Necessary immature rice seeds 
for transformation can be prepared by e.g. pollinating male 
sterility rice with a japonica variety. 

Fertility restoration in transformed plants can be 
assessed by e.g. evaluating seed fertility in standing 
plants about one month after heading. Evaluation on 
standing plants means observation of plants grown in a 
field or the like. An alternative method is a laboratory 
study of grain ripening percentages in the ear. 

V. Methods for discerning the presence of the Rf-1 gene 

According to the present invention, it was shown that 
25 a sequence determining the presence of the function of the 
Rf-1 gene is located between the polymorphism-detecting 
marker loci P4497 Mbol and B56691 Xbal on rice chromosome 
10. Moreover, complementation assays confirmed that the 
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Rf-1 gene is completely contained in especially bases 
38538-54123 of the base sequence of SEQ ID NO: 27. 

Comparison of the base sequence of an indica variety 
5 carrying the Rf-1 gene (IR2 4) (SEQ ID NO: 27) with those of 
japonica varieties not carrying said gene (Asominori (SEQ 
ID NO: 28) and Nipponbare BAC clone AC068923) revealed the 
presence of polymorphisms between both varieties. As a 
result , it became possible to conveniently, rapidly and 
10 exactly discern whether or not a rice plant or seed under 

test carries the Rf-1 gene on the basis of polymorphisms in 
base sequence in regions neighboring the Rf-1 gene. 

Therefore, the present invention also provides a 
15 method for discerning whether or not a subject rice 

individual or a seed thereof has the Rf-1 gene or not, 
wherein the method utilizing a fact that a sequence 
determining the presence of the function of the Rf-1 gene 
positions between the polymorphism detection marker loci 
20 P4497 Mbol and B56691 Xba I on rice chromosome 10. 

Polymorphisms can be detected by any known method. 
For example, known methods include assays for restriction 
fragment length polymorphisms (RFLPs); direct determination 
25 by sequencing; cutting a genomic DNA with a 8 -base 

recognizing restriction enzyme, and then radioactively 
labeling the ends and further cutting the labeled digest 
with 6 -base and 4 -bases recognizing restriction enzyme and 
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then developing the digest by two-dimensional 
electrophoresis (RLGS, Restriction Landmark Genome 
Scanning); etc. AFLP analysis (amplified fragment length 
polymorphism; P. Vos et al. , Nucleic Acids Res. Vol. 23 , 
5 pp. 4407-4414 (1995)) has also been developed wherein RFLP 
is amplified/detected by polymerase chain reaction (PGR). 

For example, conventional methods involved detecting 
RFLPs via PCR amplif ication (conversion of RFLP markers 
10 into PCR markers) or detecting polymorphisms in 

microsatellites via PCR amplification (microsatellite 
markers) as illustrated below. 

Conversion of RFLP markers into PCR markers 
15 A. PCR markers based on polymorphisms in genomic 

regions corresponding to RFLP probes (D.E. Harry, 
B.Temesgen, D.B. Neale; Codominant PCR-based markers for 
Pinus taeda developed from mapped cDNA clones, Theor. Appl. 
Genet. (1998) 97: pp. 327-336). After performing genomic 
20 PCR using primers designed for an RFLP marker probe 

sequence ("RFLP" is a polymorphism observed by Southern 
analysis using a DNA fragment as a probe. The base 
sequence of the DNA fragment used as a probe is called 
"RFLP marker probe sequence"), a PCR marker can be prepared 
25 by either of the following two procedures. A first 

procedure involves treating the products with a series of 
restriction enzymes to search for a restriction enzyme 
causing a fragment length polymorphism, and a second 
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procedure involves searching for a polymorphism by varietal 
comparison of the base sequences of the products and 
preparing a PGR marker based on the polymorphism. 



5 B. PCR markers based on identification of RFLP-causing 

sites. A PCR marker can be obtained by identifying an 
RFLP-causing site (a restriction enzyme recognition site 
carried by only one of two varieties compared) present in 
or near (normally within several kbs) an RFLP marker probe 
10 sequence. 

M i c ro satellite markers 

Microsatellites are repeat sequences of about 2 to 4 
bases such as (CA) n that are present in great numbers in 

15 genomes. If a varietal polymorphism occurs in repetition 
number, a polymorphism can be observed in PCR product 
length by PCR using primers designed in adjacent regions, 
whereby the DNA polymorphism can be detected. Markers for 
detecting polymorphisms using microsatellites are called 

20 microsatellite markers (O. Parnaud, X. Chen, S.R. McCouch, 
Mol. Gen. Genet. (1996) 252: pp. 597-607). 

Methods for detecting polymorphisms in the present 
invention are not specifically limited. From the viewpoint 
25 of efficiency and convenience, PCR-RFLP is preferred, which 
is a combination of PCR and RFLP to identify polymorphisms 
from their restriction enzyme cleavage patterns in cases 
where they exist among variety lines at restriction enzyme 
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recognition sites in the sequences of DNA fragments 
amplified by PCR. PCR-RFLP is also called CAPS (cleaved 
amplified polymorphic sequence). If any suitable 
restriction enzyme recognition site is not present in a 
5 region showing polymorphisms, a modified CAPS called dCAPS 
(derived cleaveid amplified polymorphic sequence) can also 
be used wherein restriction enzyme sites are introduced 
during PCR (Michaels, S.D. and Amasino, R.M. (1998), The 
Plant Journal 14(3) 381-385; A. Konieczny et al.,(1993), 
10 Plant J. 4(2) pp. 403-410; Neff, M.M. , Neff, J.D., Chory, J. 
and Pepper, A.E. (1998), The Plant Journal 14(3) 387-392). 
These methods are explained in more detail below. 

CAP S , d CAP S 

15 The method for discerning of the present invention 

comprise, but not limited to: 

i) preparing a pair of primers based on the base 
sequences of a site showing a polymorphism in the base 
sequences between indica and japonica varieties at the Rf-1 

20 locus and its adjacent regions to amplify said base 
sequences; 

ii) performing nucleic acid amplification reaction(s) 
using the genomic DNA of the subject rice individual or the 
seed thereof as a template; and 

25 iii) discerning whether or not the subject rice 

individual or the seed thereof has the Rf-1 gene based on 
the polymorphism found in the nucleic acid amplification 
product . 
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The step of preparing a primer pair in i) preferably 
comprises any of the following means: 

a) when a change containing a deleted region exists in 
the polymorphism in the nucleic acid amplification product, 
preparing a pair of primers for nucleic acid amplification 
to flank the deleted region to form a marker for detecting 
the polymorphism; 

b) when a base change causing a difference in 
restriction enzyme recognition exists in the polymorphism 
in the nucleic acid amplification product, preparing a pair 
of primers for nucleic acid amplification to flank the base 
change site to form a marker for detecting the 
polymorphism; or 

c) when a base change causing no difference in 
restriction enzyme recognition exists in the polymorphism 
in the nucleic acid amplification product, preparing a pair 
of primers for introducing a mismatch, wherein pair of 
primers contain the base change site and alters a region 
containing the base change site into a base sequence 
causing a difference in restriction enzyme recognition in 
the nucleic acid amplification product to form a marker for 
detecting the polymorphism. 

Suitable polymorphic sites for discerning the presence 
of the Rf-1 gene in the present invention can be 
appropriately selected so that a polymorphism detecting 
marker can be prepared as described below on the basis of 
comparison of, but not limited to, the base sequence of an 
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indica variety carrying the Rf-1 gene (IR24) (SEQ ID NO: 27) 
with those of japonica varieties not carrying said gene 
(Asominori (SEQ ID NO: 28) and Nipponbare BAC clone 
AC068923). 

5 

If the polymorphism found causes a difference in 
restriction enzyme recognition, for example, a pair of 
primers for nucleic acid amplification are prepared to 
flank the polymorphic site and used for detecting the 

10 polymorphism. Primers are preferably designed not to be 
specific for highly repeated sequences to avoid undesired 
products. If the polymorphism found does not cause a 
difference in restriction enzyme recognition, markers can 
be prepared by applying the dCAPS method described above. 

15 Primers for dCAPS markers are preferably designed not to be 
specific for repeat sequences and to provide a product 
length of preferably 50-300 bases, more preferably about 
100 bases to ease identification of polymorphisms. 



20 If the polymorphism found involves a microsatellite , 

nucleic acid amplification primers are prepared to flank 
the microsatellite and used to detect the polymorphism. 
Again, the primers are preferably designed not to be 
specific for repeat sequences. 

25 

11 Nucleic acid amp lification 

In the present invention, a pair of primers are 
preferably prepared for amplifying adjacent regions 
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containing polymorphisms on the basis of the determined 
base sequence of the nucleic acid of a subject rice 
individual or seed at the Rf-1 locus. The primer pair is 
used to perform a nucleic acid amplification reaction with 
5 the genomic DNA of the subject rice individual or seed as a 
template. The nucleic acid amplification reaction is 
preferably polymerase chain reaction (PCR) (Saiki et al. , 
1985, Science 230, pp. 1350-1354). 

10 The pair of primers for nucleic acid amplification can 

be prepared by any known method on the basis of the base 
sequence of a polymorphic site and adjacent regions 
thereto. Specifically, a primer pair can be prepared on 
the basis of the base sequence of a polymorphic site and 

15 adjacent regions thereto by a process comprising generating 
a single- stranded DNA having the same base sequence as the 
base sequence of the polymorphic site and adjacent regions 
thereto or a base sequence complementary to said regions 
or, if necessary, generating the single- stranded DNA 

20 containing a modification without affecting the binding 
specificity to the base sequence of the polymorphic site 
and adjacent regions thereto provided that the following 
conditions are satisfied: 

1) the length of each primer should be 15-30 bases; 

25 2) the proportion of G+C in the base sequence of each 

primer should be 30-70%; 

3) the distribution of A, T, G and C in the base 
sequence of each primer should not be partially largely 
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uneven ; 

4) the length of the nucleic acid amplification 
product amplified by the primer pair should be 50-3000 
bases, preferably 50-300 bases; and 
5 5) any complementary sequence segment should not occur 

with the base sequence of each primer or between the base 
sequences of the primers . 

As used herein, the "adjacent regions" to a 
10 polymorphic site mean that an area containing both of a 

polymorphic site and adjacent regions thereto is within a 
distance suitable for nucleic acid amplification, 
preferably PCR. The adjacent regions amplified preferably 
have a length within the range of, but not limited to, 
15 about 50 bases to about 3000 bases, more preferably about 

50 bases to about 2000 bases. To facilitate identification 
of polymorphisms, the product length is preferably 50-300 
bases, more preferably about 100 bases. The adjacent 
regions preferably have a length within the range of, but 
20 not limited to, about 0 to about 3000 bases, more 
preferably about 0 to about 2000 bases, still more 
preferably about 0 to about 1000 bases on the 5' or 3' side 
of a polymorphic site. 

25 Procedures and conditions for the nucleic acid 

amplification reaction are not specifically limited and are 
well known to those skilled in the art. Appropriate 
conditions can be applied by those skilled in the art 
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depending on various factors such as the base sequence of 
the polymorphic site and adjacent regions thereto, the base 
sequence and length of the primer pair, etc. Generally, 
the nucleic acid amplification reaction can be performed 
5 under more stringent conditions (annealing reaction and 

nucleic acid elongation reaction at higher temperatures and 
less cycles) as the primer pair is longer or the proportion 
of G+C is higher or the distribution of A, T, G and C is 
evener. The use of more stringent conditions allows an 
10 amplification reaction with higher specificity. 

The amplification reaction can be performed under 
conditions of, but not limited to, one cycle of 94°C for 2 
min, 30 cycles of 94°C for 1 min, 58°C for 1 min and 72°C 

15 for 2 min, and finally one cycle of 72°C for 2 min using 50 
ng of a genomic DNA as a template, 200 \xM of each dNTP and 
5 U of ExTaq™ (TAKARA) . The reaction can also be performed 
under conditions of one cycle of 94°C for 2 min, 30 cycles 
of 94°C for 1 min, 58°C for 1 min and 7 2°C for 1 min, and 

20 finally one cycle of 72°C for 2 min. In another 

embodiment, the reaction can also be performed under 
conditions of one cycle of 94°C for 2 min, 35 cycles of 94°C 
for 30 sec, 58°C for 30 sec and 72°C for 30 sec, and finally 
one cycle of 72°C for 2 min. 

25 

The subject rice (test rice) genomic DNA used as a 
template for PCR can be easily extracted from individuals 
or seeds by the method of Edwards et al. (Nucleic Acids 
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Res. 8 (6): 1349, 1991). More preferably, DNA purified by 
standard techniques is used. An especially preferred 
extraction method is the CTAB method (Murray, M.G. et al. , 
Nucleic Acids Res. 8 ( 19 ) : 4321-5 , 1980). The DNA is 
5 preferably used as a template for PCR at a final 
concentration of 0.5 ng/nl*. 

2) Preparation of markers for detecting polymorphisms 
After examining whether or not a polymorphism is 
10 detected in the amplification product by the nucleic acid 

amplification reaction with a pair of primers, a marker for 
detecting the polymorphism is prepared on the basis of the 
polymorphism found. Non-limiting examples of polymorphisms 
that can be detected in the amplification product are as 
15 follows. 

a) A change containing a deleted region exists in the 
polymorphism in the nucleic acid amplification product. 

20 In this case, a pair of primers for nucleic acid 

amplification are prepared to flank the deleted region to 
form a marker for detecting the polymorphism. If the 
deleted region has a sufficient size, the polymorphism can 
be detected from the difference in mobility by 

25 electrophoresing the amplification product on an agarose 
gel or an acrylamide gel, for example. The polymorphism 
can be detected when the difference in base pair numbers is 
about 5% or more in the case of agarose gel electrophoresis 
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or when the difference in length is about 1 base or more in 
the case of sequencing acrylamide gel electrophoresis, for 
example. Alternatively, the polymorphism can be detected 
by hybridizing the nucleic acid amplification product using 
5 an oligobase or a DNA fragment having a complementary 

sequence to the base sequence excluding the deleted region 
as an analytical probe. Alternatively, the polymorphism 
can be confirmed by determining the base sequence of the 
amplification product, if desired. Known techniques for 

10 electrophoresis of nucleic acids, hybridization, sequencing 
and the like can be used as appropriate by those skilled in 
the art. In this case, the difference in the length of the 
amplification product directly reflects the polymorphism 
and markers for detecting polymorphisms on this basis are 

15 called ALP (amplicon length polymorphism) markers. 

b) A base change causing a difference in restriction 
enzyme recognition exists in the polymorphism in the 
nucleic acid amplification product. 

20 

In this case, a pair of primers for nucleic acid 
amplification are prepared to flank the base change site to 
form a marker for detecting the polymorphism. In this 
case, a base change causing a difference in restriction 
25 enzyme recognition occurs in the polymorphism of the 

nucleic acid amplification product, i.e. the nucleic acid 
amplification product may be cleaved or not with one or 
more specific restriction enzymes. Thus, the amplification 



product can be treated with the restriction enzymes and 
electrophoresed on e.g. an agarose gel to detect the 
polymorphism from the difference in mobility. The 
polymorphism can be confirmed by determining the base 
sequence of the amplification product, if desired. 

In this case, the difference in the length of the 
restriction fragment of the amplification product by PCR or 
the like reflects the polymorphism and markers for 
detecting polymorphisms on this basis are called CAPS 
markers or PCR-RFLP markers (A. Konieczny et al. , supra.) 

This is exemplified by primer pairs P4497 Mbol , P23945 
Mbol, P41030 TaqI, P45177 BstUI , B59066 BsaJI and B56691 
Xbal in Example 1 below. Even if the polymorphism can be 
detected by the length of the nucleic acid amplification 
product as described in a) above, the polymorphism can be 
more easily detected by combination with restriction enzyme 
treatment . 

c) A base change causing no difference in restriction 
enzyme recognition exists in the polymorphism in the 
nucleic acid amplification product. 

In this case, a pair of primers for introducing a 
mismatch are prepared that contains the base change site 
and alters a region containing the base change site into a 
base sequence causing a difference in restriction enzyme 
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recognition in the nucleic acid amplification product to 
form a marker for detecting the polymorphism. 

Specifically, a pair of primers based on the base 
5 sequences of regions naturally proximal to the Rf-1 gene 
cause a polymorphism in the nucleic acid amplification 
product but no difference in restriction enzyme 
recognition, and therefore, a mismatch is introduced into 
one or both of the primers to alter a region containing the 

10 base change site (polymorphism) into a base sequence 

causing a difference in restriction enzyme recognition in 
the nucleic acid amplification product. For example, the 
method described in Mikaelian et al., Nucl. Acids. Res. 
20:376.1992 can be used as a standard technique for 

15 substituting, deleting or adding a specific base by PCR- 
mediated site-specific mutagenesis. The amplif ication 
product using the mismatch-introducing primers as a marker 
for detecting the polymorphism may be cleaved or not with 
one or more specific restriction enzymes because it has a 

20 difference in restriction enzyme recognition at the 

mismatch- introducing site. Therefore, the amplification 
product can be treated with the restriction enzymes and 
electrophoresed on e.g. an agarose gel to detect the 
polymorphism from the difference in mobility, as described 

25 in b) above. 

The introduction of a mismatch must not affect not 
only the binding of the primers to a target plant genome 
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but also the polymorphic base change. The polymorphic base 
change is used to introduce a mismatch near it so that a 
difference in restriction enzyme recognition occurs by a 
combination of both base change and mismatch. Methods for 
introducing such a mismatch are known to those skilled in 
the art and described in detail in Michaels, S.D. and 
Amasino, R.M. (1998), Neff, M.M. , Neff, J.D., Chory, J. and 
Pepper, A.E. (1998), for example. 

Markers in this case are improved CAPS markers 
described in b) above and called dCAPS (derived CAPS) 
markers. This is exemplified by P9493 BslI in Example 3 
below. 

If there are many extra restriction sites unrelated to 
varietal polymorphisms in the case of b) or c) above, it 
may be difficult to discern any difference in restriction 
site recognition based on polymorphisms. In this case, a 
mismatch may be introduced into a primer as appropriate to 
abolish unnecessary restriction sites. For example, a 
mismatch was introduced into the R-primer to abolish the 
Mspl site unrelated to polymorphisms in B6 0304 Mspl in 
Example 3 . 

Although the invention is not limited to any specific 
method, CAPS or dCAPS methods have several advantages over 
other RFLP methods. Specifically, analyses can be made 
with smaller amounts of samples than in RFLP, for example. 
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Another advantage is that the time and labor required for 
analyses can be greatly reduced. Polymorphisms detected 
with PCR markers can be visualized by agarose gel 
electrophoresis that is easier than acrylamide gel 
electrophoresis used for microsatellite markers. 

Preferred embodim ents of th e di sc erning method nf i-h^ 
present invention 

Preferred embodiments of the method for discerning 
whether or not a subject rice has the Rf-1 gene are 
described below for illustrative purposes. In the examples 
herein, it was found that the base sequence of an indica 
variety IR24 carrying the Rf-1 gene (SEQ ID NO: 27) has at 
least the following polymorphisms l)-8) as compared with 
corresponding regions of japonica varieties: 

1) a base corresponding to the base 1239 of SEQ ID 
NO: 27 is A; 

2) a base corresponding to the base 6227 of SEQ ID 
NO: 27 is A; 

3) a base corresponding to the base 20680 of SEQ ID 
NO: 27 is G; 

4) a base corresponding to the base 45461 of SEQ ID 
NO: 27 is A; 

5) a base corresponding to the base 49609 of SEQ ID 
NO:27 is A; 

6) a base corresponding to the base 56368 of SEQ ID 
NO: 27 is T; 

7) a base corresponding to the base 57629 of SEQ ID 



NO: 27 is C; and 

8) a base corresponding to the base 66267 of SEQ ID 
NO: 27 is G. 

5 In preferred embodiments of the present invention, 

therefore, the subject rice individual or seed is judged as 
carrying the Rf-1 gene when one to all of the requirements 
l)-8) above are met. 

10 We further verified that a region essential for the 

expression of the function of the Rf-1 gene is contained in 
especially the bases 38538-54123, preferably the bases 
42357-53743, more preferably the bases 42132-48883, still 
more preferably the bases 42132-46318 in the base sequence 

15 of SEQ ID NO: 27. In an embodiment of the present 

invention, therefore, the subject rice individual or seed 
is determined to have the Rf-1 gene in the case that the 
nucleic acid having a base sequence which is identical to 
at least 70% of the base sequence of SEQ ID NO. 27 or of the 

20 base sequence of bases 38538-54123 of SEQ ID NO. 27, meets 

at least one of the following requirements 1) and 2): 

1) a base corresponding to the base 45461 of SEQ ID 

NO. 27 is A; and 

2) a base corresponding to the base 49609 of SEQ ID 
25 NO. 27 is A. 

Known polymorphism detecting methods can be used to 
determine whether or not the above requirements are met. 
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The base sequence of adjacent regions containing said 
sequence can also be directly determined. However, CAPS or 
dCAPS methods described above are preferably used because 
they are rapid and convenient. CAPS or dCAPS methods can 

5 be performed by a protocol comprising, for example: 

i) preparing a pair of primers based on a base 

sequence of adjacent regions including any one of the 
following base; 

1) a base corresponding to the base 1239 of SEQ ID 
10 NO: 27; 

2) a base corresponding to the base 6227 of SEQ ID 
NO: 27; 

3) a base corresponding to the base 20680 of SEQ ID 
NO: 27; 

15 4) a base corresponding to the base 45461 of SEQ ID 

NO: 27; 

5) a base corresponding to the base 49609 of SEQ ID 
NO: 27; 

6) a base corresponding to the base 56368 of SEQ ID 
20 NO:27; 

7) a base corresponding to the base 57629 of SEQ ID 
NO: 27; and 

8) a base corresponding to the base 66267 of SEQ ID 
NO: 27 is G. 

25 to amplify both the base of the above and adjacent regions 
thereto; 

ii) performing nucleic acid amplification reaction (s) 
using the genome DNA of the subject rice individual or the 



seed thereof as a template; and 

iii) discerning the presence of the Rf-1 in the 
subject rice individual or the seed thereof based on 
polymorphism found in said nucleic acid amplification 
5 product . 

The detection of polymorphisms in the nucleic acid 
amplification product is performed by, but not limited to, 
discerning the subject rice individual or seed to have the 
10 Rf-1 gene when one to all of the requirements l)-8) below 
are met : 

1) a region including a base corresponding to the base 
1239 of SEQ ID NO: 27 does not have any Mbol recognition 
sequence; 

15 2) a region including a base corresponding to the base 

6227 of SEQ ID NO: 27 does not have any BslI recognition 
sequence; 

3) a region including a base corresponding to the base 
20680 of SEQ ID NO: 27 does not have any Mbol recognition 

20 sequence; 

4) a region including a base corresponding to the base 
45461 of SEQ ID NO: 27 does not have any TaqI recognition 
sequence; 

5) a region including a base corresponding to the base 
25 49609 of SEQ ID NO: 27 does not have any BstUI recognition 

sequence; 

6) a region including a base corresponding to the base 
56368 of SEQ ID NO: 27 does not have any Mspl recognition 



sequence; 

7) a region including a base corresponding to the base 
57629 of SEQ ID NO: 27 does not have any BsaJI recognition 
sequence; and 

8) a region including a base corresponding to the base 
66267 of SEQ ID NO: 27 does not have any Xbal recognition 
sequence . 

However, the present invention is not limited to the 
restriction enzymes above so far as each polymorphism in 
the specific regions l)-8) above can be detected. 

Preferably, identification methods of the present 

invention comprise: 

i) preparing a pair of primers based on a base 

sequence of adjacent regions including any one of the 
following base; 

1) a base corresponding to the base 45461; or 

2) a base corresponding to the base 49609; 

to amplify both the base of the above and adjacent regions 
thereto; 

ii) performing nucleic acid amplification reaction(s) 
using the genome DNA of the subject rice individual or the 
seed thereof as a template; and 

iii) discerning the presence of the Rf-1 in the 
subject rice individual or the seed thereof based on 
polymorphism found in said nucleic acid amplification 
product. The subject rice individual or seed thereof is 
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determined to have the Rf-1 gene in step iii), although not 
limited to, when at least one of the following requirements 
1) and 2) is met: 

1) a region including a base corresponding to the base 
5 45461 of SEQ ID NO: 27 does not have any TaqI recognition 

sequence; 

2) a region including a base corresponding to the base 
49609 of SEQ ID NO: 27 does not have any BstUI recognition 
sequence. 

10 

The base 45461 of SEQ ID NO: 27 discussed above 
corresponds to 1) the base 1769 of SEQ ID NO. 69; 2) the 
base 1767 of SEQ ID NO. 70; 3) the base 1772 of SEQ ID 
NO. 71; 4) the base 1762 of SEQ ID NO. 72; 5) the base 1703 
15 of SEQ ID NO. 73; 6) the base 1779 of SEQ ID NO. 74; 7) the 
base 1783 of SEQ ID NO. 80; 8) the base 1729 of SEQ ID 
NO. 81; 9) the base 1781 of SEQ ID NO. 82; 10) the base 1774 
of SEQ ID NO. 83; 11) the base 1728 of SEQ ID NO. 84; and 12) 
the base 1644 of SEQ ID NO. 85. 

20 

Primer pairs used for the amplification reaction can 
be appropriately selected by those skilled in the art to 
preferably satisfy the conditions above on the basis of the 
base sequence of SEQ ID NO: 27. Preferably, any primer pair 
25 having a base sequence selected from the group consisting 
of SEQ ID NOS: 39 and 40, SEQ ID NOS : 41 and 42, SEQ ID 
NOS: 43 and 44, SEQ ID NOS: 45 and 46, SEQ ID NOS: 47 and 
48, SEQ ID NOS: 49 and 50, SEQ ID NOS: 51 and 52, and SEQ 



ID NOS: 53 and 54 is used. More preferably, the primer 
pair is selected from the group consisting of SEQ ID NOS: 
45 and 46, and SEQ ID NOS: 47 and 48. If necessary, the 
sequences of the above primer pairs containing 
5 substitutions, deletions or additions while retaining the 
binding specificity for the base sequence of the 
polymorphic site and adjacent regions thereto can also be 
used as primers . 

10 To examine the resulting PCR product for restriction 

fragment length polymorphisms, it is cleaved with 
restriction enzymes corresponding to the restriction sites 
present in PCR markers. Such cleavage is accomplished by 
incubation for several hours to a day at the recommended 

15 reaction temperature for the restriction enzymes used. The 
PCR amplified sample cleaved with the restriction enzymes 
can be analyzed by electrophoresis on an about 0.7% - 2% 
agarose gel or an about 3% MetaPhor™ agarose gel. The gel 
is visualized under UV light in ethidium bromide, for 

20 example. 

In the most preferred embodiments of the present 

invention, restriction enzyme cleavage patterns show the 

bands as shown in Table 2 below on the visualized gel 
25 depending on the primer pair used. 
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Table 2 

Approximate size 
(bp) of detected band 
Amplified with P4497 MobI (SEQ ID NOS: 39 and 40) 
5 Restriction enzyme Mbol 

Test rice genome having the Rf-1 gene (homozygous): 730 

no : 385 # 345 

Amplified with P9493 BslI (SEQ ID NOS: 41 and 42) 
Restriction enzyme BslI 
10 Test rice genome having the Rf-1 gene (homozygous): 126 

no : 100 , 26 

Amplified with P23945 Mbol (SEQ ID NOS: 43 and 44) 
Restriction enzyme Mbol 

Test rice genome having the Rf-1 gene (homozygous): 
15 160, 100 

no : 260 

Amplified with P41030 TaqI (SEQ ID NOS: 45 and 46) 
Restriction enzyme TaqI 

Test rice genome having the Rf-1 gene (homozygous): 280 

20 no : 90, 190 

Amplified with P45177 BstUI (SEQ ID NOS: 47 and 48) 
Restriction enzyme BstUI 

Test rice genome having the Rf-1 gene (homozygous): 

20,65,730 

25 no : 20,65,175,555 

Amplified with B60304 Mspl (SEQ ID NOS: 49 and 50) 
Restriction enzyme Mspl 
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Test rice genome having the Rf-1 gene (homozygous): 330 

no ; 220, 110 

Amplified with B59066 BsaJI (SEQ ID NOS: 51 and 52) 
Restriction enzyme BsaJI 

Test rice genome having the Rf-1 gene (homozygous): 420 

no : 65, 355 

Amplified with B56691 Xbal (SEQ ID NOS: 53 and 54) 
Restriction enzyme Xbal 

Test rice genome having the Rf-1 gene (homozygous): 670 
no : 140 , 530 

In Example 3 below, recombinants proximal to the Rf-1 
gene having pollen fertility (RS1-RS2, RC1-RC8) were tested 
for the chromosomal organization of the Rf-1 region using 
14 polymorphic markers including the 8 primer pairs 
described above. As a result, it was confirmed that all 
the plants carry the Rf-1 gene derived from the indica 
variety between P9493 BslI and 59066 BsaJI, This result 
showed that recombinant pollens having the chromosomal 
organization as shown in Fig. 3 have pollen fertility, i.e. 
the Rf-1 gene is functional in these pollens. This means 
that a sequence determining the presence of the function of 
the Rf-1 gene is included in the indica region common to 
these recombinant pollens, i.e. in a region from the P4497 
Mbol to B56691 Xbal loci (about 65 kb) as estimated at 
maximum . 

In the present invention, chromosomal walking was 
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started on the presumption that the S12564 Tsp509I locus 
should be vary proximal to the Rf-1 locus as judged from 
the frequency of appearance of individuals by crossing. In 
fact, the genetic distance between both loci has been 
5 calculated to be about 0.04 cM as the result of the high- 
precision segregation analysis of the present invention. 
Even one of markers known to be most closely linked to the 
Rf-1 locus as described in Japanese Patent Public 
Disclosure No. 2000-139465 is reported to have a genetic 

10 distance of 1 cM from the Rf-1 locus. Considering that 1 
cM is estimated to be equivalent to 300 kb on average in 
rice, a considerable time should be required to restrict 
the Rf-1 gene region if chromosomal walking were started 
from the marker described in Japanese Patent Public 

15 Disclosure No. 2000-139465. 

VI, Method f or inhibiting the function of Rf-1 gene to 
restore fertility 

According to the present invention, the nucleic acid 

20 containing the locus of a fertility restorer gene (Rf-1) 
including the nucleic acids which function to restore 
fertility was isolated. The entire base sequence thereof 
was determined, whereby the fertility restoring function of 
the Rf-1 gene can be controlled by genetic engineering 

25 techniques. Thus, the present invention further provides a 
method for inhibiting the function of Rf-1 to restore 
fertility. 
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A method for inhibiting the function of the Rf-1 gene 
to restore fertility according to one embodiment of the 
present invention comprises introducing an antisense having 
at least 100 continuous bases in length, and having a base 
5 sequence complementary to a nucleic acid having the base 
sequence of SEQ ID NO .27, or to a nucleic acid having a 
base sequence which is identical to at least 70% of the 
base sequence of SEQ ID NO .27, and which functions to 
restore fertility . 

10 

In an embodiment, the method for inhibiting the 
function of the Rf-1 gene to restore fertility according to 
the present invention comprises introducing an antisense 
having at least 100 continuous bases in length, and being 

15 selected from base sequences complementary to a nucleic 
acid having the base sequence of the bases 38538-54123, 
preferably the bases 42357-53743, more preferably the bases 
42132-48883 of SEQ ID NO: 27, or to a nucleic acid having a 
base sequence which is identical to at least 70% of the 

20 base sequence of the bases 38538-54123, preferably the 

bases 42357-53743, more preferably the bases 42132-48883, 
still more preferably the bases 42132-46318 of SEQ ID NO: 27 
and, which functions to restore fertility. 

25 In an especially preferable embodiment, the method for 

inhibiting the function of the Rf-1 gene to restore 
fertility according to the present invention comprises 
introducing an antisense having at least 100 bases in 
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length, and being selected from base sequences 
complementary to a nucleic acid encoding the amino acid 
sequence of SEQ ID NO. 75, or an amino acid sequence which 
is identical to at least 70% of the amino acid sequence of 
5 SEQ ID NO. 75, and which functions to restore fertility. 

Most preferably, the nucleic acid encoding the amino 
acid sequence of SEQ ID NO. 7 5 , or an amino acid sequence 
which is identical to at least 70% of the amino acid 
10 sequence of SEQ ID NO. 75 is selected from nucleic acids of 
the following a) - p) : 

a) a nucleic acid comprising the bases 215-2587 of 
SEQ ID NO: 69; 

b) a nucleic acid comprising the bases 213-2585 of 
15 SEQ ID NO: 70; 

c) a nucleic acid comprising the bases 218-2590 of 
SEQ ID NO: 71; 

d) a nucleic acid comprising the bases 208-2580 of 
SEQ ID NO: 72; 

20 e) a nucleic acid comprising the bases 149-2521 of 

SEQ ID NO: 73; 

f ) a nucleic acid comprising the bases 225-2597 of 
SEQ ID NO: 74; 

g) a nucleic acid comprising the bases 43907-46279 of 
25 SEQ ID NO: 27; 

h) a nucleic acid comprising the bases 229-2601 of 
SEQ ID NO: 80; 

i) a nucleic acid comprising the bases 175-2547 of 



SEQ ID NO:81; 

j) a nucleic acid comprising the bases 227-2599 of 
SEQ ID NO: 82; 

k) a nucleic acid comprising the bases 220-2592 of 
5 SEQ ID NO: 83; 

1) a nucleic acid comprising the bases 174-2546 of 
SEQ ID NO: 84; 

m) a nucleic acid comprising the bases 90-2462 of SEQ 
ID NO:85; 

10 n) a nucleic acid which is identical to at least 70% 

of the nucleic acid of any of a) - m) , and which functions 
to restore fertility; 

o) a nucleic acid which hybridizes to the nucleic 
acid of any of a) - m) under a moderate or high stringent 

15 condition, and which functions to restore fertility; and 
p) a nucleic acid wherein one or a plurality of 
base(s) is deleted from, added to or substituted from the 
nucleic acid of any of a) - m) , and which functions to 
restore fertility. 

20 

The antisense has a length of at least 100 bases or 
more, more preferably 500 bases or more, most preferably 
1000 bases or more. From the viewpoint of technical 
convenience of introduction, it preferably has a length of 
25 10000 bases or less, more preferably 5000 bases or less. 
The antisense can be synthesized by known methods. The 
antisense can be introduced into rice by known methods as 
described in e.g. Terada et al. (Plant Cell Physiol. 2000 



Jul, 41(7), pp. 881-888). 

It is also anticipated that Rf-1 disrupted lines can 
be established by screening variant lines containing a 
5 transposable element such as, but not limited to, Tosl7 
(Hirochika H. et al. 1996, Proc. Natl. Acad. Sci. USA 93, 
pp. 7783-7788) for a line containing the transposable 
element in the base sequence of SEQ ID NO: 27. In plants, 
gene disruption by homologous recombination has been 

10 studied. It may also be possible to inhibit fertility 

restoring function by establishing such a line in which the 
Rf-1 gene has been replaced by a variant Rf-1 gene using a 
nucleic acid having the base sequence of SEQ ID NO. 27, or a 
nucleic acid having a base sequence which is identical to 

15 at least 70% of the base sequence of SEQ ID NO. 27. 
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Exam p le s 

The following examples further illustrate the present 
invention but are not intended to limit the technical scope 
of the invention. Those skilled in the art can readily add 
modifications /changes to the present invention on the basis 
of the description of the specification, and those 
modifications/changes are included in the technical scope 
of the present invention. 

Reference examples 

The following reference examples are based on the 
examples described in our prior application (Japanese 
Patent Application No. 2000-247204 filed August 17, 2000). 

Reference example 1: Conversion of RFLP markers around Rf-1 
gene to P CR markers 

In this reference example, nine RFLP markers (i.e., 
R1877, G291, R2303, S12564, C1361, S10019, G4003, S10602 
and G2155) around the locus of Rf-1 gene were converted to 
PCR markers . 

( 1 ) Materials and methods 

The following nine RFLP markers, R1877, G291, R2303, 
S12564, C1361, S10019, G4003, S10602 and G2155, were 
purchased from the National Institute of Agrobiological 
Sciences, the Ministry of Agriculture, Forestry and 
Fisheries of Japan. After determining the base sequences 
of the inserts in the vectors, experiments were conducted 
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according to the following procedures. Among rice 
varieties herein, Asominori belongs to japonica, and IR24 
belongs to indica. 

5 (2) Preparation of Asominori genomic library 

Total DNA was extracted from green leaves of Asominori 
by the CTAB method. After partial digestion with Mbol , the 
DNA was fractionated according to size by NaCl density 
gradient centrif ugation (6-20% linear gradient, 20°C # 

10 37,000 rpm, 4 hr, total volume = 12 mL) . A portion of each 
fraction (about 0.5 mL) was subjected to electrophoresis 
and fractions containing 15-20 kb DNA were collected and 
purified. A library was constructed using Lambda DASH II 
(Stratagene) as a vector in accordance with the attached 

15 protocol. Giga Pack III Gold (Stratagene) was used for 

packaging. After packaging, 500 \iL of SM Buffer and 20 \xL 
of chloroform were added. After centrif ugation , 20 [iL of 
chloroform was added to the supernatant to make a library 
solution . 

20 

XL-1 Blue MRA (P2) was infected with 5 [iL of a 50-fold 
dilution of the library solution, whereupon 83 plaques were 
formed. This corresponded to 4.15 x 10 5 pfu per library, 
and hence, it was calculated that the plaques covered 8.3 x 
25 lO 9 bp assuming that the average length of the inserted. 

fragments was 20 kb. The library was therefore considered 
to have an adequate size for the rice genome (4 x 10 8 bp). 
(3) Isolation of genomic clones containing R1877-, C1361- 



and G400 3 -marker regions. 

As for C1361 and G4003, plasmids containing the RFLP 
marker probe were isolated and subjected to restriction 
5 enzyme treatment and electrophoresis to separate the RFLP 
marker probe portion; the desired DNA was recovered on a 
DNA recovery filter (Takara SUPREC-01). As for R1877, 
primers were designed that were specific to both ends of 
the marker probe and PCR was performed with the total DNA 

10 of Asominori used as a template; the amplification products 
were electrophoresed and recovered by the method described 
above. The recovered DNA was labelled with a Rediprime DNA 
Labelling System (Amersham Pharmacia) to prepare a probe 
for screening the library. PCR was performed in the usual 

15 manner (this also applies to the following description). 

Screening of the library was performed in the usual 
manner after blotting the plaques onto Hybond-N+ (Amersham 
Pharmacia). After primary screening, areas of positive 
20 plaques were individually punched out, suspended in SM 
buffer and subjected to the second round of screening. 
After the second screening, the positive plaques were 
punched out and subjected to the third round of screening 
to isolate a single plaque. 

25 

The isolated plaque of interest was suspended in SM 
buffer and primary multiplication of the phage was 
performed by the plate lysate method. The resulting phage- 
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enriched solution was subjected to secondary multiplication 
by shake culture and the phage DNA was purified with Lambda 
starter kit (QIAGEN) . 

5 For each marker, primary screening was conducted on 

eight plates. A 10 [iL aliquot of the library solution was 
employed per plate. After the primary, second and third 
rounds of screening, four genomic clones in association 
with R1877 were isolated and three were isolated in 
10 association with each of C1361 and G4003. 



(4) Conversion of R1877 to PCR marker 

The isolated genomic clones were analyzed to identify 
the causative site of RFLP , or the EcoRI site that exists 
15 in IR24 (indica rice) but not in Asominori (japonica rice), 
thereby converting R187 7 to a PCR marker. 



Specifically, the four isolated clones were subjected 
to the following analyses. First, T3 and T7 primers were 

20 used to determine the base sequences at both ends of the 
insert in each clone. Then, primers extending outwardly 
from both ends of the marker probe were designed. They 
were combined with T3 and T7 primers to give a combination 
of four primers in total, and employed in PCR with each 

25 clone used as the template. 



In a separate step, each clone was digested with NotI 
and EcoRI, and electrophoresed to estimate the insert size 
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and the length of each EcoRI fragment. 

These analyses revealed the relative positions of the 
individual clones. In RFLP analysis, marker probe R1877 
was reported to detect an EcoRI fragment of 20 kb in 
Nipponbare (japonica rice) and one of 6 . 4 kb in Kasalath 
(indica rice) 

(ftp : / /ftp . staff . or . jp/pub/geneticmap9 8/parentsouthern/chrl 
0/R1877 . JPG) . This fact, taken together with the results 
of analysis described above, gave a putative position for 
the EcoRI site that existed in IR24 but not in Asominori. 
Hence, a primer combination (SEQ ID NO:l x SEQ ID NO: 2) 
that was designed to amplify the nearby region was employed 
to perform genomic PCR over 30 cycles, each cycle 
consisting of 94°C x 1 min, 58°C x 1 min and 72°C x 2 min. 
The PCR product was treated with EcoRI and subjected to 
electrophoresis on 0.7% agarose gel. 

As a result, the expected polymorphisms were observed 
between Asominori and IR24. By treatment with EcoRI, the 
PCR product (-3200 bp) was cleaved to yield 1500 bp and 
1700 bp fragments in IR24 but not in Asominori. Mapping of 
the marker was made with an RIL (recombinant inbred line) 
of Asominori- IR24 with the results that the PCR marker was 
located in the same region as that of RFLP marker locus 
R1877, thereby confirming the conversion of RFLP marker 
R1877 to a PCR marker, which was named R187 7 EcoRI in the 
present invention . 



(5) Conversion of G4003 to PCR marker 

The isolated genomic clones were analyzed to identify 
the causative site of RFLP, or the Hindlll site that 
existed in Asominori but not in IR24, thereby converting 
5 G4003 to a PCR marker. 

By performing analyses similar to those employed for 
R1877, the relative positions of the three isolated clones 
were revealed. In RFLP analysis, marker probe G4003 was 

10 reported to detect a Hindi I I fragment of 3 kb in Nipponbare 
(japonica rice) and one of 10 kb in Kasalath (indica rice) 
(ftp: //ftp. staff . or . jp/pub/genet icmap98 /parent southern/chrl 
0/R1877. JPG) . This report, taken together with the 
analyses described above, led to a temporary conclusion 

15 that the Hindlll site that existed in Asominori but not in 
IR24 would be at either one of two candidate sites. Hence, 
a primer combination (SEQ ID NOS: 3 and 4) that was 
designed to amplify the area in the neighborhood of each 
Hindlll site was employed to perform genomic PCR over 35 

20 cycles, each cycle consisting of 94°C x 30 sec, 58°C x 30 
sec and 72°C x 30 sec. The PCR product was treated with 
Hindlll and subjected to electrophoresis on 2% agarose gel. 
As a result, the Hindlll site within the marker probe was 
found to have polymorphisms. By treatment with Hindlll, 

25 the PCR product (362 bp) was cleaved to yield a 95 bp 
fragment and a 267 bp fragment in Asominori but not in 
IR24. Mapping of the site demonstrated the conversion of 
RFLP marker G4003 to a PCR marker, which was named G4003 



Hindlll (SEQ ID NO: 19) in the present invention. 

(6) Conversion of C1361 to PCR marker 

Primers were designed on the basis of the base 
5 sequence information of the isolated genomic clones . PCR 
was performed with the total DNAs of Asominori and IR24 
being used as a template and the PCR product was recovered 
by known methods after electrophoresis. Using the 
recovered DNA as a template, the inventors analyzed the 
10 base sequence of each of the rice varieties with ABI Model 
310 in search of mutations that would cause polymorphisms. 

By performing analyses similar to those employed for 
R1877, approximate relative positions of the three isolated 

15 clones could be established. As it turned out, however, 
regions around the C1361 marker would be difficult to 
amplify by PCR or determine their base sequences, and 
hence, it would not be easy to identify the causative site 
of RFLP. Hence, the inventors took notice of the region 

20 capable of yielding a comparatively long PCR product (2.7 
kb) and made an attempt to create a dCAPS marker. 

Specifically, upon comparing the base sequences of the 
genomic PCR products of said region using Asominori and 
25 Koshihikari (both japonica rice) and Kasalath and IR24 
(both indica rice), the inventors found six sites of 
polymorphism between japonica and indica. One of these six 
sites was used to create a dCAPS marker. To this end, with 
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SEQ ID NO: 5 and SEQ ID NO: 6 used as primers, PCR was 
performed over 35 cycles, each cycle consisting of 94°C x 
30 sec, 58°C x 30 sec and 72°C x 30 sec. The PCR product 
was treated with Mwol and analyzed by electrophoresis on 3% 
5 MetaPhor™ agarose gel. In Asominori, cleavage occurred at 
two sites to give three observable bands of about 2 5 bp, 
50 bp and 79 bp, but in IR24 cleavage occurred at one site 
to give two observable bands of about 50 bp and 107 bp. 
Mapping demonstrated the conversion of RFLP marker C1361 to 
10 a PCR marker, which was named C1361 Mwol (SEQ ID NO: 20) in 
the present invention. 

(7) Conversion of G2155 to PCR marker 

Primers specific to both ends of the marker probe were 
15 designed and PCR was performed with the total DNA of 

Asominori, Koshihikari, IR24 or IL216 (a line produced by 

introducing Rf-1 gene into Koshihikari by back crossing; 

its genotype was Rf-l/Rf-1) being used as a template. 

Purification of the PCR product and searching for a 
20 mutation that would be useful for providing restriction 

fragment polymorphisms were performed by the methods 

already described above. 

Specifically, as a result of comparing the base 
25 sequences of corresponding regions of the varieties under 
test, mutations were found at three sites between the 
variety/line (IR24 and IL216) having Rf-1 gene and the 
variety (Asominori and Koshihikari) not having Rf-1 gene. 
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One of the three sites was utilized to create a dCAPS 
marker. To this end, SEQ ID NO: 7 and SEQ ID NO: 8 were used 
as primers to perform PCR over 35 cycles, each cycle 
consisting of 94°C x 30 sec, 58°C x 30 sec and 72°C x 30 
5 sec. The PCR product was treated with Mwol and analyzed by 
electrophoresis on 3% MetaPhor™ agarose gel; In Asominori, 
cleavage occurred at one site to give two observable bands 
of about 2 5 bp and 10 5 bp, but in IR24, cleavage occurred 
at two sites to give three observable bands of about 2 5 bp, 
10 27 bp and 78 bp. Mapping demonstrated the conversion of 
RFLP marker G2155 to a PCR marker, which was named G2155 
Mwol (SEQ ID NO: 21) in the present invention. 

(8) Conversion of G291 to PCR marker 

15 Primers specific to internal sequences of the marker 

probe were designed and used in various combinations to 
perform PCR to find a primer combination that could yield 
an amplif ication product of the expected size. Using the 
selected primer combination, the inventors performed PCR 

20 with the total DNA of Asominori, Koshihikari, IR24 and 

IL216 used as a template. Purification of the PCR product 
and searching for a mutation that could be utilized in 
providing restriction fragment polymorphisms were performed 
by the methods already described above. 

25 

Specifically, using the primers designed to be 
specific for the marker probe sequence, the inventors 
performed genomic PCR of each variety under test and 



compared the base sequences of the products. As a result, 
mutations were found at four sites between the variety/line 
having Rf-1 gene (IR24 and IL216) and the variety 
(Asominori and Koshihikari) not having Rf-1 gene. One of 
5 the four sites was used to create a dCAPS marker. To this 
end, SEQ ID NO:9 and SEQ ID NO: 10 were used as primers to 
perform PCR over 35 cycles, each cycle consisting of 94°C x 
30 sec, 58°C x 30 sec and 72°C x 30 sec. The PCR product 
was treated with Mspl and analyzed by electrophoresis on 3% 

10 MetaPhor™ agarose gel. In the varieties /lines having Rf-1 
gene, cleavage occurred at two sites to give three 
observable bands of about 25 bp, 49 bp and 5 5 bp, but in 
the varieties not having Rf-1 gene, cleavage occurred at 
one site to give two observable bands of about 25 bp and 

15 104 bp. Mapping demonstrated the conversion of RFLP marker 
G291 to a PCR marker, which was named G291 Mspl (SEQ ID 
NO: 22) in the present invention. 

(9) Conversion of R2303 to PCR marker 

20 Primers specific to internal sequences of the marker 

probe were designed and PCR was performed with the total 
DNA of Asominori (japonica rice) and IR24 and Kasalath 
(indica rice) used as a template. Purification of the PCR 
product and searching for a mutation that could be used for 

25 providing restriction fragment polymorphisms were performed 
by the methods already described above. 



As a result of comparing the base sequences of 
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corresponding regions of the varieties under test, a 
mutation was found between japonica rice and indica rice. 
Since the mutation occurred at the BslI recognition site, 
the site was directly used to create a CAPS marker. To 
5 this end, SEQ ID NO: 11 and SEQ ID NO: 12 were used as 

primers and PCR was performed over 30 cycles, each cycle 
consisting of 94°C x 1 min, 58°C x 1 min and 72°C x 2 min . 
The PCR product was treated with BslI and analyzed by 
electrophoresis on 2% agarose gel. In japonica rice, 

10 cleavage occurred at one site to give two observable bands 
of about 238 bp and 1334 bp, but in indica rice, cleavage 
occurred at two sites to give three observable bands of 
about 238 bp, 655 bp and 679 bp. Mapping demonstrated the 
conversion of RFLP marker R2303 to a PCR marker, which was 

15 named R2303 BslI (SEQ ID NO: 23) in the present invention. 

(10) Converting S10019 to PCR marker 

S10019 was converted to a PCR marker in accordance 
with the method (9) of converting R2303 to a PCR marker. 

20 

Specifically, as a result of comparing the base 
sequences of corresponding regions of the varieties under 
test, a mutation was found between japonica rice and indica 
rice. Since the mutation occurred at the BstUI recognition 
25 site, the site was directly used to create a CAPS marker. 
To this end, SEQ ID NO: 13 and SEQ ID NO: 14 were used as 
primers and PCR was performed over 30 cycles, each cycle 
consisting of 94°C x 1 min, 58°C x 1 min and 72°C x 1 min. 
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The PCR product was treated with BstUI and analyzed by 
electrophoresis on 2% agarose gel. In japonica rice, 
cleavage occurred at one site to give two observable bands 
of about 130 bp and 462 bp, but in indica rice, cleavage 
5 occurred at two sites to give three observable bands of 

about 130 bp, 218 bp and 244 bp. Mapping demonstrated the 
conversion of RFLP marker S10019 to a PCR marker, which was 
named S10019 BstUI ( SEQ ID NO: 24) in the present invention. 

10 (11) Conversion of S10602 to PCR marker 

S10602 was converted to a PCR marker in accordance 
with the method (9) of converting R23 03 to a PCR marker. 

Specifically, as a result of comparing the base 
15 sequences of corresponding regions of the varieties under 

test, a mutation was found between japonica rice and indica 
rice. The mutation was used to create a CAPS marker. To 
this end, SEQ ID NO: 15 and SEQ ID NO: 16 were used as 
primers and PCR was performed over 33 cycles, each cycle 
20 consisting of 94°C x 1 min, 58°C x 1 min and 72°C x 1 min. 
The PCR product was treated with Kpnl and analyzed by 
electrophoresis on 2% agarose gel. In japonica rice, 
cleavage occurred at one site to give two observable bands 
of about 117 bp and 607 bp, but in indica rice, no cleavage 
25 occurred, giving only an observable band of 724 bp. 

Mapping demonstrated the conversion of RFLP marker S10602 
to a PCR marker, which was named S10602 Kpnl (SEQ ID NO: 25) 
in the present invention. 
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(12) Conversion of S12564 to PCR marker 

S12564 was converted to a PCR marker in accordance 
with the method of converting R2303 to a PCR marker. 



5 Specifically, as a result of comparing the base 

sequences of corresponding regions of the varieties under 
test, a mutation was found between japonica rice and indica 
rice. The mutation was used to create a dCAPS marker. To 
this end, SEQ ID NO: 17 and SEQ ID NO: 18 were used as 

10 primers and PCR was performed over 35 cycles, each cycle 
consisting of 94°C x 30 sec, 58°C x 30 sec and 72°C x 30 
sec. The PCR product was treated with Tsp509I and analyzed 
by electrophoresis on 3% MetaPhor™ agarose gel. In 
japonica rice, cleavage occurred at two sites to give three 

15 observable bands of 26 bp, 41 bp and 91 bp, but in indica 
rice, cleavage occurred at one site to give two observable 
bands of 41bp and 117 bp. Mapping demonstrated the 
conversion of RFLP marker S12564 to a PCR marker, which was 
named S12564 Tsp509I (SEQ ID NO: 26) in the present 

20 invention. 

Reference example 2: Mapping of Rf-1 Gene Locus 

DNA was extracted from 104 2 seedlings of the Fl 
population produced by pollinating MS Koshihikari with MS- 
25 FR Koshihikari, and the DNA extract was used in the 

analysis. MS Koshihikari (generation: BC10F1) was created 
by replacing the cytoplasm of Koshihikari with BT type male 
sterility cytoplasm. MS-FR Koshihikari was a line created 
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by introducing Rf-1 gene from IR8 (supplied from National 
Institute of Agrobiological Sciences) into MS Koshihikari 
(the locus of Rf-1 gene being heterozygous). 

5 First, each individual was investigated for the 

genotype at two marker loci R1877 EcoRI and G2155 Mwol 
described in Reference example 1 that would presumably be 
located on opposite sides of the locus of Rf-1 gene. 
Japonica type homozygotes with respect to either locus 

10 R1877 EcRI or G2155 Mwol were regarded as recombinants 
between these two marker loci. Then, each of such 
recombinants was investigated for the genotypes of G291 
Mspl, R2303 BslI, S12564 Tsp 5091, C1361 Mwol, S10019 
BstUI, G4003 Hindlll and S10602 Kpnl loci, and the 

15 positions of recombination were identified. 

The genotype investigation with respect to R1877 EcoRI 
and G2155 Mwol loci revealed that 46 individuals were 
recombinants around the locus of Rf-1 gene. Genotypes of 
20 the marker loci around the locus of Rf-1 gene were 
investigated and the results are shown in Table 3. 
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Table 3. Genotypes of Marker Loci in Recombinant 
Individuals Around Rf-1 Locus 
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J: Homozygous Koshihikari type 

H: Heterozygous Koshihikari type/ MS-FR Koshihikari type 

5 

As shown in Table 3, recombinant 8 homozygous for 
japonica at the S1256 4 Tsp509I marker locus and 
recombinants 9 and 10 homozygous for japonica at the C1361 
Mwo marker locus were obtained. As all of these 

10 recombinants restored fertility, the former was regarded as 
a recombinant between the Rf-1 and S12564 Tsp509I loci 
while the latter were regarded as recombinants between the 
Rf-1 and C1361 Mwol loci, showing that the Rf-1 gene is 
located between the S12564 Tsp509I and C1361 Mwol loci. 

15 Based on the report that only pollens carrying the Rf-1 

gene have fertility in individuals having the BT type male 
sterile cytoplasm in the cross above (C.Shinjyo, 
JAPAN. J. GENETICS Vol.44, No . 3 : 149-156 ( 1969 )) , the Rf-1 gene 
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locus could be located on a detailed linkage map (Fig. 4). 

Example 1; Acquisition of recombinant individuals proximal 
to the Rf-1 locus 
5 (Materials and Methods) 

DNA was extracted from each of 4103 individuals of 
BC10F1 population produced by pollinating MS Koshihikari 
(generation: BC10F1) with MS-FR Koshihikari (generation: 
BC9F1, heterozygous at the Rf-1 locus), and genotyped at 

10 the S12564 Tsp509I and C1361 Mwol loci in the same manner 
as described in Reference example 2 above. Individuals 
having a genotype homozygous for Koshihikari at the S12564 
Tsp509I locus were regarded as those generated by 
recombination between the Rf-1 and S12564 Tsp509I loci, 

15 while individuals having a genotype homozygous for 

Koshihikari at the C1361 Mwol locus were regarded as those 
generated by recombination between the Rf-1 and C1361 Mwol 
loci. 

20 (Results and Discussion) 

A survey of 4103 individuals revealed one recombinant 
individual between the Rf-1 and S12564 Tsp509I loci and 6 
recombinant individuals between the Rf-1 and C1361 Mwol 
loci. The previous survey of 1042 individuals obtained by 

25 crossing in Reference example 2 above had already revealed 
one recombinant individual between the Rf-1 and S12564 
Tsp509I loci and 2 recombinant individuals between the Rf-1 
and C1361 Mwol loci as shown in Table 3. 
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Thus, a total of 2 recombinant individuals between the 
Rf-1 and S12564 Tsp509I loci and 8 recombinant individuals 
between the Rf-1 and C1361 Mwol loci were able to be 
oabtained from 5145 individuals. These 10 individuals were 
5 tested by high-precision segregation analysis in the 
examples below. 

Example 2 chromosomal walking 

(1) First chromosomal walking 
10 (Materials and Methods) 

A genomic library was constructed from the genomic DNA 

of Asominori japonica (not carrying Rf-1) using Lambda DASH 

II vector as described in Reference example 1 and tested by 

chromosomal walking. 
15 PCR was routinely performed using total DNA of 

Asominori as a template in combination with the following 

primer pair : 

5 ' -atcaggagccttcaaattgggaac-3 ' (SEQ ID NO: 29) and 
5 ' -ctcgcaaattgcttaattttgacc-3 ' (SEQ ID NO: 30) 
20 designed for a partial base sequence (Accession No. D47284) 
of RFLP probe S12564. The resulting amplification products 
of about 1200 bp were electrophoresed on an agarose gel and 
then purified by QIAEXII (QIAGEN) . The purified DNA was 
labeled with a rediprime DNA labelling system (Amersham 
2 5 Pharmacia) to give a library screening probe (probe A, Fig. 
1). 



The library was routinely screened after plaques were 
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blotted onto Hybond-N + (Amersham Pharmacia). Single 
plaques were separated, after which phage DNA was purified 
by the plate lysate method using Lambda Midi kit (QIAGEN) . 

(Results and Discussion) 

The results of terminal base sequence analysis and 
restriction enzyme fragment length analysis showed that two 
(WSA1 and WSA3) of 4 clones obtained by screening were in a 
relative position as shown in Fig. 1, The Asominori 
genomic base sequences corresponding to WSA1 and WSA3 were 
determined by primer walking (DNA Sequencer 377 , ABI). 

(2) Second chromosomal walking 
(Materials and Methods) 

In addition to the Asominori genomic library described 
above, an IR24 genomic library was similarly constructed 
from the genomic DNA of an indica variety IR24 (carrying 
Rf-1) and tested by chromosomal walking. 

PCR was routinely performed using DNA of WSA3 as a 
template in combination with the following primer pair: 
5'-tgaaggagttatgggtgcgtgacg-3' (SEQ ID N0:31) and 
5' -ttgccgagcacacttgccatgtgc-3' (SEQ ID NO: 32) 
designed for the Asominori genomic base sequence determined 
in (1). The resulting amplification products of 524 bp 
were purified and labeled by the method described above to 
give a library screening probe (probe E # Fig. 1). 
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Library screening and phage DNA purification were 
performed by the method described above. 

(Results and Discussion) 
5 The results of terminal base sequence analysis and 

restriction enzyme fragment length analysis showed that one 
(WSE8) of 15 clones obtained by screening of the Asominori 
genomic library was in a relative position as shown in Fig. 
1 . The Asominori genomic base sequence corresponding to 
10 WSE8 was determined by primer walking. 

The results of terminal base sequence analysis and 
restriction enzyme fragment length analysis showed that two 
(XSE1 and XSE7) of 7 clones obtained by screening of the 
15 IR24 genomic library were in a relative position as shown 
in Fig. 1. The IR24 genomic base sequences corresponding 
to XSE1 and XSE7 were determined by primer walking. 

(3) Third chromosomal walking 
20 (Materials and Methods) 

The Asominori genomic library and IR2 4 genomic library 
described above were tested by chromosomal walking. 

PCR was routinely performed using DNA of WSE8 as a 
25 template in combination with the following primer pair: 
5 ' -gcgacgcaatggacatagtgctcc-3 ' (SEQ ID NO: 33) and 
5 9 -ttacctgccaagcaatatccatcg-3 ' (SEQ ID NO: 34) 
designed for the Asominori genomic base sequence determined 
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in (2). The resulting amplification products of 1159 bp 
were purified and labeled by the method described above to 
give a library screening probe (probe F, Fig. 1). 

Library screening and phage DNA purification were 
performed by the method described above. 

(Results and Discussion) 

The results of terminal base sequence analysis and 
restriction enzyme fragment length analysis showed that two 
(WSF5 and WSF7) of 8 clones obtained by screening of the 
Asominori genomic library were in a relative position as 
shown in Fig. 1. The Asominori genomic base sequences 
corresponding to WSF5 and WSF7 were determined by primer 
walking . 

The results of terminal base sequence analysis and 
restriction enzyme fragment length analysis showed that two 
(XSF4 and XSF20) of 13 clones obtained by screening of the 
IR2 4 genomic library were in a relative position as shown 
in Fig. 1. The IR24 genomic base sequences corresponding 
to XSF4 and XSF20 were determined by primer walking. 

(4) Fourth chromosomal walking 
(Materials and Methods) 

The Asominori genomic library and IR2 4 genomic library 
described above were tested by chromosomal walking. 
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PCR was routinely performed using DNA of WSF7 as a 
template in combination with the following primer pair: 
5 ' -aaggcatactcagtggagggcaag-3 ' (SEQ ID NO: 35) and 
5 ' -ttaacctgaccgcaagcacctgtc-3 ' (SEQ ID NO: 36) 
5 designed for the Asominori genomic base sequence determined 
in (3). The resulting amplification products of 456 bp 
were purified and labeled by the method described above to 
give a library screening probe (probe G, Fig. 1). 

10 Library screening and phage DNA purification were 

performed by the method described above. 

(Results and Discussion) 

The results of terminal base sequence analysis and 

15 restriction enzyme fragment length analysis showed that two 
(WSG2 and WSG6 ) of 6 clones obtained by screening of the 
Asominori genomic library were in a relative position as 
shown in Fig. 1. The Asominori genomic base sequences 
corresponding to WSG2 and WSG6 were determined by primer 

20 walking. 

The results of terminal base sequence analysis and 
restriction enzyme fragment length analysis showed that 
three (XSG8 f XSG16 and XSG22) of 14 clones obtained by 
25 screening of the IR24 genomic library were in a relative 
position as shown in Fig. 1. The IR24 genomic base 
sequences corresponding to XSG8, XSG16 and XSG22 were 
determined by primer walking. 
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(5) Fifth chromosomal walking 
(Materials and Methods) 

The IR24 genomic library described above was tested by 
chromosomal walking. 

5 

We perused the public website of TIGR (The Institute 
for Genomic Research) and found that a BAC (Bacterial 
Artificial Chromosome) clone (Accession No. AC068923) 
containing RFLP marker S12564 had been deposited with a 
10 public database (GenBank) . This BAC clone contains the 
genomic DNA of Nipponbare japonica and it was shown from 
base sequence comparison to completely include the contig 
regions of Asominori and IR24 prepared in (l)-(4) (Fig. 2). 

15 Thus, PCR was routinely performed using total DNA of 

IR24 as a template in combination with the following primer 
pair: 

5 ' -tggatggactatgtggggtcagtc-3 ' (SEQ ID NO: 37) and 
5 ' -agtggaagtggagagagtagggag-3 ' (SEQ ID NO: 38) 
20 designed to amplify a part of this BAC clone. The 

resulting amplification products of about 600 bp were 
purified and labeled by the method described above to give 
a library screening probe (probe H, Fig. 1). 

25 Library screening and phage DNA purification were 

performed by the method described above. 



(Results and Discussion) 

- 109 - 



The results of terminal base sequence analysis and 
restriction enzyme fragment length analysis showed that one 
(XSH18) of 15 clones obtained by screening of the IR24 
genomic library was in a relative position as shown in Fig. 
1. The IR24 genomic base sequence corresponding to XSH18 
was determined by primer walking. 

Example 3; High precision segregation analysis 
(1) Development of PCR marker P449 7 Mbol 

Comparison between the genomic base sequence 
corresponding to the IR24 contig (SEQ ID NO: 27) and the 
genomic base sequence corresponding to the Asominori contig 
(SEQ ID NO: 28) determined in Example 2 revealed that the 
1239th base of SEQ ID NO: 27 is A while the 12631st base of 
SEQ ID NO: 28 corresponding to said position is G. 

For detecting this change, fragments of about 730 bp 
are first amplified by PCR from a region surrounding said 
position using the following primer pair: 
P4497 Mbol F: 
5 ' -ccctccaacacataaatggttgag-3 ' (SEQ ID NO: 39) 
(corresponding to bases 853-876 of SEQ ID NO: 27) 
(corresponding to bases 12247-12270 of SEQ ID NO: 28) 
and 

P4497 Mbol R: 
5 ' -tttctgccaggaaactgttagatg-3 ' (SEQ ID NO: 40) 
(corresponding to bases 1583-1560 of SEQ ID NO: 27) 
(corresponding to bases 12975-12952 of SEQ ID NO:28). 
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The amplification products can be visualized by 
electrophoresis on an agarose gel after treatment with 
Mbol. Thus, the change can be detected as a difference in 
mobility in the agarose gel due to the difference in the 
5 length of DNA after Mbol treatment because the 

amplification products from Asominori DNA having an Mbol 
recognition sequence (GATC) are cleaved with Mbol while the 
amplification products from IR24 DNA are not cleaved with 
Mbol for the lack of the Mbol recognition sequence. 

10 

(2) Development of PCR marker P9493 BslI 

Comparison between the genomic base sequence 
corresponding to the IR24 contig (SEQ ID NO: 27) and the 
genomic base sequence corresponding to the Asominori contig 
15 (SEQ ID NO: 28) determined in Example 2 revealed that the 

6227th base of SEQ ID NO: 27 is A while the 17627th base of 
SEQ ID NO: 28 corresponding to said position is C. 

For detecting this change, fragments of 126 bp are 
20 first amplified by PCR from a region surrounding said 
position using the following primer pair: 
P9493 BslI F: 
5 ' -gcgatcttatacgcatactatgcg-3 9 (SEQ ID NO: 41) 
(corresponding to bases 6129-6152 of SEQ ID NO: 27) 
25 (corresponding to bases 17529-17552 of SEQ ID NO: 28) 

and 

P9493 BslI R: 
5 ' -aaagtctttgttccttcaccaagg-3 9 (SEQ ID NO: 42) 
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(corresponding to bases 6254-6231 of SEQ ID NO:27) 
(corresponding to bases 17654-17631 of SEQ ID NO:28). 
The amplification products can be visualized by 
electrophoresis on an agarose gel after treatment with 
5 Bsll. Thus, the change can be detected as a difference in 
mobility in the agarose gel due to the difference in the 
length of DNA after Bsll treatment because the 
amplification products from Asominori DNA having a Bsll 
recognition sequence (CCNNNNNNNGG) are cleaved with Bsll 
10 while the amplification products from IR24 DNA are not 
cleaved with Bsll for the lack of the Bsll recognition 
sequence . 

This marker was developed by applying the dCAPS method 
15 (Michaels and Amasino 1998, Neff et al . , 1998). 

Specifically, g is substituted for a at the base 6236 of 
SEQ ID NO: 27 and the base 17636 of SEQ ID NO: 28 by the use 
of P9493 Bsll R primer described above. Thus, the 
fragments from Asominori DNA come to have a sequence of 
20 CCtttccttGfi at 17626-17636 of SEQ ID NO: 28 so that they are 
cleaved with Bsll . 

(3) Development of PCR marker P23945 Mbol 

Comparison between the genomic base sequence 
25 corresponding to the IR24 contig (SEQ ID NO: 27) and the 

genomic base sequence corresponding to the Asominori contig 
(SEQ ID NO: 28) determined in Example 2 revealed that the 
20680th base of SEQ ID NO: 27 is G while the 32079th base of 



SEQ ID NO: 28 corresponding to said position is A. 



For detecting this change, fragments of 260 bp are 
first amplified by PCR from a region surrounding said 
5 position using the following primer pair: 
P23945 Mbol F: 
5 ' -gaggatttatcaaaacaggatggacg-3 ' (SEQ ID NO: 43) 
(corresponding to bases 20519-20544 of SEQ ID NO: 27) 
(corresponding to bases 31918-31943 of SEQ ID NO: 28) 
10 and 

P23945 Mbol R: 
5' -tgggcggcagcagtggaggataga-3' (SEQ ID NO: 44) 
(corresponding to bases 20778-20755 of SEQ ID NO: 27) 
(corresponding to bases 32177-32154 of SEQ ID N0J28). 
15 The amplification products can be visualized by 

electrophoresis on an agarose gel after treatment with 
Mbol. Thus, the change can be detected as a difference in 
mobility in the agarose gel due to the difference in the 
length of DNA after Mbol treatment because the 
20 amplification products from IR24 DNA having an Mbol 

recognition sequence (GATC) are cleaved with Mbol while the 
amplification products from Asominori DNA are not cleaved 
with Mbol for the lack of the Mbol recognition sequence. 

25 (4) Development of PCR marker P41030 TaqI 

Comparison between the genomic base sequence 
corresponding to the IR24 contig (SEQ ID NO: 27) and the 
genomic base sequence corresponding to the Asominori contig 

- 113 - 



(SEQ ID NO; 28) determined in Example 2 revealed that the 
45461st base of SEQ ID NO: 27 is A while the 49164th base of 
SEQ ID NO: 28 corresponding to said position is G. 

5 For detecting this change, fragments of 28 0 bp are 

first amplified by PCR from a region surrounding said 
position using the following primer pair: 
P41030 TaqI F: 
5 ' -aagaagggagggttatagaatctg-3 ' (SEQ ID NO: 45) 
10 (corresponding to bases 45369-45392 of SEQ ID NO: 27) 

(corresponding to bases 49072-49095 of SEQ ID NO: 28) 
and 

P41030 TaqI R: 
5 ' -atatcaggactaacaccactgctc-3 ' (SEQ ID NO: 46) 

15 (corresponding to bases 45648-45625 of SEQ ID NO: 27) 

(corresponding to bases 49351-49328 of SEQ ID NO: 28). 
The amplification products can be visualized by 
electrophoresis on an agarose gel after treatment with 
TaqI. Thus, the change can be detected as a difference in 

20 mobility in the agarose gel due to the difference in the 
length of DNA after TaqI treatment because the 
amplification products from Asominori DNA having a TaqI 
recognition sequence (TCGA) are cleaved with TaqI while the 
amplification products from IR24 DNA are not cleaved with 

25 TaqI for the lack of the TaqI recognition sequence. 

(5) Development of PCR marker P45177 BstUI 

Comparison between the genomic base sequence 
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corresponding to the IR24 contig (SEQ ID NO: 27) and the 
genomic base sequence corresponding to the Asominori contig 
(SEQ ID NO: 28) determined in Example 2 revealed that the 
49609th base of SEQ ID NO: 27 is A while the 53311st base of 
5 SEQ ID NO: 28 corresponding to said position is G. 

For detecting this change, fragments of 812 bp are 
first amplified by PCR from a region surrounding said 
position using the following primer pair: 
10 P45177 BstUI F: 

5' -acgagtagtagcgatcttccagcg-3 ' (SEQ ID NO: 47) 
(corresponding to bases 49355-49378 of SEQ ID NO: 27) 
(corresponding to bases 53057-53080 of SEQ ID NO: 28) 
and 

15 P45177 BstUI R: 

5 ' -cagcgtgaaactaaaaacggaggc-3 ' (SEQ ID NO: 48) 
(corresponding to bases 50166-50143 of SEQ ID NO: 27) 
(corresponding to bases 53868-53845 of SEQ ID NO:28). 
The amplification products can be visualized^ by 
20 electrophoresis on an agarose gel after treatment with 

BstUI. Thus, the change can be detected as a difference in 
mobility in the agarose gel due to the difference in the 
length of DNA after BstUI treatment because the 
amplification products from IR24 DNA having a BstUI 
25 recognition sequence (CGCG) at two positions are cleaved 
into 3 fragments with BstUI while the amplification 
products from Asominori DNA having the BstUI recognition 
sequence at three positions are cleaved with BstUI into 
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four fragments. 

(6) Development of PCR marker B60304 Mspl 

Comparison between the genomic base sequence 
5 corresponding to the IR24 contig (SEQ ID NO: 27) determined 
in Example 2 and the base sequence of the BAC clone 
described above (Accession No. AC068923) revealed that the 
56368th base of SEQ ID NO: 27 is T while the base of 
AC068923 corresponding to said position is C. 

10 

For detecting this change, fragments of about 330 bp 
are first amplified by PCR from a region surrounding said 
position using the following primer pair: 
B60304 Mspl F: 
15 5 ' -atcccacatcatcataatccgacc-3 9 (SEQ ID NO: 49) 

(corresponding to bases 56149-56172 of SEQ ID NO: 27) 
and 

B60304 Mspl R: 
5 ' -agcttctcccttggatacggtggcg-3 ' (SEQ ID NO: 50) 

20 (corresponding to bases 56479-56455 of SEQ ID NO:27). 

The amplification products can be visualized by 
electrophoresis on an agarose gel after treatment with 
Mspl. Thus, the change can be detected as a difference in 
mobility in the agarose gel due to the difference in the 

2 5 length of DNA after Mspl treatment because the 

amplification products from Nipponbare DNA having an Mspl 
recognition sequence (CCGG) are cleaved with Mspl while the 
amplification products from IR24 DNA are not cleaved with 
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Mspl for the lack of the Mspl recognition sequence. 

This marker was developed by applying the dCAPS 
method. Specifically, t is substituted for g at base 56463 
5 of SEQ ID NO: 27 by the use of B60304 Mspl R primer. As a 
result, the Mspl recognition sequence of bases 56460-56463 
of SEQ ID NO: 27 changes from CCGfi into ccgt so that the 
fragments from SEQ ID NO: 27 become unable to be cleaved 
with Mspl. Thus, the fragments from IR24 have no Mspl 
10 recognition sequence, while DNA from Nipponbare has the 
Mspl recognition sequence at one position in a region 
corresponding to bases 56367-56370 of SEQ ID NO: 27. 

(7) Development of PCR marker B59066 BsaJI 
15 Comparison between the genomic base sequence 

corresponding to the IR24 contig (SEQ ID NO: 27) determined 
in Example 2 and the base sequence of the BAC clone 
described above (Accession No, AC068923) revealed that the 
57629th base of SEQ ID NO: 27 is C while the base of 
20 AC068923 corresponding to said position is CC. 

For detecting this change, fragments of about 420 bp 
are first amplified by PCR from a region surrounding said 
position using the following primer pair: 
25 B59066 BsaJI F: 

5' -atttgttggttagttgcggctgag-3' (SEQ ID NO: 51) 
(corresponding to bases 57563-57586 of SEQ ID NO: 27) 
and 
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B59066 BsaJI R: 
5 ' -gcccaaactcaaaaggagagaacc-3 ' (SEQ ID NO: 52) 
(corresponding to bases 57983-57960 of SEQ ID NO:27). 
The amplification products can be visualized by 
5 electrophoresis on an agarose gel after treatment with 

BsaJI. Thus, the change can be detected as a difference in 
mobility in the agarose gel due to the difference in the 
length of DNA after BsaJI treatment because the 
amplification products from Nipponbare DNA having a BsaJI 
10 recognition sequence (CCNNGG) are cleaved with BsaJI while 
the amplification products from IR24 DNA are not cleaved 
with BsaJI for the lack of the BsaJI recognition sequence. 

(8) Development of PCR marker B56691 Xbal 
15 Comparison between the genomic base sequence 

corresponding to the IR24 contig (SEQ ID NO: 27) determined 
in Example 2 and the base sequence of the BAC clone 
described above (Accession No. AC068923) revealed that the 
66267th base of SEQ ID NO: 27 is G while the base of 
20 AC068923 corresponding to said position is C. 



For detecting this change, fragments of about 670 bp 
are first amplified by PCR from a region surrounding said 
position using the following primer pair: 
25 B56691 Xbal F: 

5 ' -cctcaagtctcccctaaagccact-3 ' (SEQ ID NO: 53) 
(corresponding to bases 66129-66152 of SEQ ID NO:27) 
and 
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B56691 Xbal R: 
5 ' -gctctactgctgataaaccgtgag-3 ' (SEQ ID NO: 54) 
(corresponding to bases 66799-66776 of SEQ ID NO:27). 
The amplification products can be visualized by 
5 electrophoresis on an agarose gel after treatment with 

Xbal. Thus, the change can be detected as a difference in 
mobility in the agarose gel due to the difference in the 
length of DNA after Xbal treatment because the 
amplification products from Nipponbare DNA having an Xbal 
10 recognition sequence (TCTAGA) are cleaved with Xbal while 
the amplification products from IR24 DNA are not cleaved 
with Xbal for the lack of the Xbal recognition sequence. 

(9) Development of PCR marker B53627 BstZ17I 
15 Comparison between the genomic base sequence 

corresponding to the IR24 contig (SEQ ID NO: 27) determined 
in Example 2 and the base sequence of the BAC clone 
described above (Accession No. AC068923) revealed that the 
69331st base of SEQ ID NO: 27 is T while the base of 
20 AC068923 corresponding to said position is C. 

For detecting this change, fragments of about 620 bp 
are first amplified by PCR from a region surrounding said 
position using the following primer pair: 
25 B53627 BstZ17I F: 

5 ' -tggatggactatgtggggtcagtc-3 ' (SEQ ID NO: 55) 
(corresponding to bases 68965-68988 of SEQ ID NO: 27) 
and 
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B53627 BstZ17I R: 
5' -agtggaagtggagagagtagggag-3 ' (SEQ ID NO: 56) 
(corresponding to bases 69582-69559 of SEQ ID NO:27). 
The amplification products can be visualized by 
electrophoresis on an agarose gel after treatment with 
BstZ17I. Thus, the change can be detected as a difference 
in mobility in the agarose gel due to the difference in the 
length of DNA after BstZ17I treatment because the 
amplification products from IR24 DNA having a BstZ17I 
recognition sequence ( GTATAC ) are cleaved with BstZ17I 
while the amplification products from Nipponbare DNA are 
not cleaved with BstZ17I for the lack of the BstZ17I 
recognition sequence . 

(10) Development of PCR marker B40936 Msel 

Development of all the following PCR markers (10) -(12) 
relates to a study of the base sequences corresponding to 
further downstream regions (3') of base 76363 at the 3' end 
of SEQ ID NO: 27. 

The following primer pair was designed for the base 
sequence of the BAC clone described above (Accession No. 
AC068923) : 

5' -tacgacgccatttcactccattgc-3 ' (SEQ ID NO: 57) 
and 

5' -catttctctatgggcgttgctctg-3' (SEQ ID NO: 58). 
PCR was routinely performed using this primer pair in 
combination with total DNAs of MS-FR Koshihikari (genotype 
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of the Rf-1 locus: Rf-1 Rf-1) and Koshihikari as templates. 
The resulting amplification products of about 1300 bp were 
electrophoresed on an agarose gel and then purified by 
QIAEXII (QIAGEN) . Analysis of the base sequence of the 
purified DNA by a DNA sequencer 377 ( ABI ) showed several 
polymorphisms . 

One of them can be detected by PCR amplification of a 
region surrounding said position using the following primer 
pair: 

B40936 Msel F: 
5 ' -acctgtaggtatggcaccttcaacac-3 ' (SEQ ID NO: 59) 

and 

B40936 Msel R: 

5 ' -ccaaggaacgaagttcaaatgtatgg-3 ' (SEQ ID NO:60). 
The amplification products can be visualized by 
electrophoresis on an agarose gel after treatment with 
Msel. Thus, the change can be detected as a difference in 
mobility in the agarose gel due to the difference in the 
length of DNA after Msel treatment because the 
amplification products from MS-FR Koshihikari (Rf-1 Rf-1) 
DNA having an Msel recognition sequence (TTAA) are cleaved 
with Msel while the amplification products from Koshihikari 
DNA are not cleaved with Msel for the lack of the Msel 
recognition sequence . 

This marker was developed by applying the dCAPS 
method . 
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(11) Development of PCR marker B19839 Mwol 

The following primer pair was designed for the base 
sequence of the BAC clone described above (Accession No. 
AC068923) : 

5 5' -tgatgtgtttgggcatccctttcg-3' (SEQ ID NO: 61) 
and 

5 ' -gagataggggacgacagacacgac-3 ' (SEQ ID NO: 62). 
PCR was routinely performed using this primer pair in 
combination with total DNAs of MS-FR Koshihikari (genotype 

10 of the Rf-1 locus: Rf-1 Rf-1) and Koshihikari as templates. 
The resulting amplification products of about 1200 bp were 
electrophoresed on an agarose gel and then purified by 
QIAEXII (QIAGEN) . Analysis of the base sequence of the 
purified DNA by a DNA sequencer 377 (ABI) showed several 

15 polymorphisms. 



One of them can be detected by PCR amplification of a 
region surrounding said position using the following primer 
pair: 
20 B19839 Mwol F: 

5 ' -tcctatggctgtttagaaactgcaca-3 ' (SEQ ID NO: 63) 
and 

B19839 Mwol R: 
5 ' -caagttcaaacataactggcgttg-3 ' (SEQ ID NO:64). 
25 The amplification products can be visualized by 

electrophoresis on an agarose gel after treatment with 
Mwol. Thus, the change can be detected as a difference in 
mobility in the agarose gel due to the difference in the 
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length of DNA after Mwol treatment because the 
amplification products from Koshihikari DNA having an Mwol 
recognition sequence ( GCNNNNNNNGC ) are cleaved with Mwol 
while the amplification products from MS-FR Koshihikari 
5 (Rf-1 Rf-1) DNA are not cleaved with Mwol for the lack of 
the Mwol recognition sequence. 

This marker was developed by applying the dCAPS 
method. 

10 

(12) Development of PCR marker B2387 Bfal 

The following primer pair was designed for the base 
sequence of the BAC clone described above (Accession No. 
AC068923): 

15 5 ' -cactgtcctgtaagtgtgctgtgc-3 ' (SEQ ID NO:65) 
and 

5 ' -caagcgtgtgataaaatgtgacgc-3 ' (SEQ ID NO: 66). 
PCR was routinely performed using this primer pair in 
combination with total DNAs of MS-FR Koshihikari (genotype 

20 of the Rf-1 locus: Rf-1 Rf-1) and Koshihikari as templates. 
The resulting amplification products of about 1300 bp were 
electrophoresed on an agarose gel and then purified by 
QIAEXII (QIAGEN) . Analysis of the base sequence of the 
purified DNA by a DNA sequencer 377 (ABI) showed several 

25 polymorphisms. 

One of them can be detected by PCR amplification of a 
region surrounding said position using the following primer 
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pair: 

B2387 Bfal F: 
5 ' -tgcctactgccattactatgtgac-3 ' (SEQ ID NO: 67) 
and 

5 B2387 Bfal R: 

5 ' -acatactaccgtaaatggtctctg-3 ' (SEQ ID NO: 68). 
The amplification products can be visualized by 
electrophoresis on an agarose gel after treatment with 
Bfal. Thus, the change can be detected as a difference in 

10 mobility in the agarose gel due to the difference in the 
length of DNA after Bfal treatment because the 
amplification products from Koshihikari DNA having an Bfal 
recognition sequence (CTAG) are cleaved with Bfal while the 
amplification products from MS-FR Koshihikari (Rf-1 Rf-1) 

15 DNA are not cleaved with Bfal for the lack of the Bfal 
recognition sequence. 

(13) Segregation analysis 

Two recombinants between the Rf-1 and S12564 Tsp509I 

20 loci (RSI and RS2) and 8 recombinants between the Rf-1 and 
C1361 Mwol loci (RC1 to RC8) obtained in Example 1 were 
genotyped at the 12 DNA marker loci developed in (1) to 
(12) above. The results are shown in Table 4 along with 
the genotypes of each recombinant at the S12564 Tsp509I and 

25 C1361 Mwol loci. 
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V 



Table 4. Genotypes of recombinants proximal to the Rf-1 
locus at various marker loci 



Locus ' 


RSI 


RS2 


RC1 


RC2 


RC3 


RC4 


RC5 


RC6 


RC7 


RC8 


S 1 2564 Tsp509l 


J 


J 


H 


H 


H 


H 


H 


H 


H 


H 


P4497 Mbol 


J 


J 


H 


H 


H 


H 


H 


H 


H 


H 


P9493 Bslt . 


H 


H 


H 


H 


H 


H 


H 


H 


H 


H 


P23945 Mbol 


H 


H 


H 


H 


H 


H 


H 


H 


H 


H 


P41030 Taql. 


H 


H 


H 


H 


H 


H 


H 


H 


H 


H 


P45177 BstUI 


H 


H 


H 


H 


H 


H 


H 


H 


H 


H 


B60304 Mspl 


H 


H 


H 


H 


H 


H 


H 


H 


H 


H 


B59066 BsaJI- 


H 


H 


H 


H 


H 


H 


H 


H 


H 


H 


B56691 Xbal 


H 


H 


H 


H 


H 


H 


H 




H 


H 


B53627 BstZUI 


H 


H 


H 


H 


H 


H 


H 




H 


H 


B40936 Msel 


H 


H 


H 


H 


H 


H 


H 




H 


H 


B 19839 Mwol 


H 


H 


H 


H 


H 


J 


H 




H 


H 


B2387 Bfal 


H 


H 


H 


H 


H 


J 


H 




H 


J 


C.1361 Mwol 


H 


H 


J 


J 


J 


J 


J 




J 


J 



J: Homozygous for Koshihikari 

H: Heterozygous for Koshihikari /MS-FR Koshihikari 

5 

Table 4 shows that all the recombinants have an 
indica-derived Rf-1 chromosomal region between P9493 BslI 
and 59066 BsaJI . This result showed that recombinant 
pollens having the chromosomal organization as shown in 

10 Fig. 3 have pollen fertility, i.e. the Rf-1 gene is 

functional in these pollens. This means that a sequence 
determining the presence of the function of the Rf-1 gene 
is included in the indica region common to these 
recombinant pollens, i.e. in a region from the P4497 Mbol 

15 to B56691 Xbal loci (about 65 kb) as estimated at maximum. 



However, there is a possibility that it is important 
for the expression of the genetic function of the Rf-1 gene 
that the Rf-1 gene is partially of the indica genotype, and 
20 that the genetic function may not be significantly changed 
whether the remaining regions are of the japonica or indica 
genotype. Therefore, it cannot be concluded that the 
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common indica region above (bases 1239-66267 of SEQ ID 
NO: 27) completely contains the entire Rf-1 gene. However, 
it is thought that at least SEQ ID NO: 27 completely 
contains the entire Rf-1 gene for the following reasons: 
5 1) the size of a gene is normally several kilobases, 

and rarely exceeds 10 kb; 

2) the genomic base sequence of IR2 4 determined by the 
present invention (SEQ ID NO: 27) completely contains the 
common indica region above; 
10 3) the 5 r end of SEQ ID NO: 27 is located 1238 bp 

upstream of the 5' end of the common indica region above 
and forms a part of another gene (S12564); and 

4) the 3' end of SEQ ID NO: 27 is located 10096 bp 
downstream of the 3' end of the common indica region above . 

15 

Exam ple 4= complementation assa y for a 9.7 Kb fragment from 

XSE1 

(Materials and Methods) 

The K phage clone XSE1 (Figs. 1 and 5) was completely 
20 digested with NotI and electrophoresed on an agarose gel. 
The separated 9.7 kb fragment (including bases 1-9657 of 
SEQ ID NO:27) was purified by QIAEXII (QIAGEN) . 

On the other hand, an intermediate vector pSB200 
25 having a hygromycin-resistant gene cassette was prepared on 
the basis of pSBll (Komari et al. , supra.). Specifically, 
a nopaline synthase terminator (Tnos) was first fused to a 
ubiquitin promoter and a ubiquitin intron (Pubi-ubil). A 
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hygromycin- resistant gene (HYG(R)) was inserted between 
ubil and Tnos of the resulting Pubi-ubil-Tnos complex to 
give an assembly of Pubi-ubil-HYG(R) -Tnos . This assembly 
was fused to a Hindlll/EcoRI fragment of pSBll to give 
pKY205. Linker sites for adding restriction enzyme sites 
Notl. NspV, EcoRV, Kpnl . Sad . EcoRI were inserted into the 
Hind III site upstream of Pubi of this pKY205 to give 
pSB200 having a hygromycin-resistant gene cassette. 

After the plasmid vector pSB200 was completely 
digested with Notl, DNA was recovered by ethanol 
precipitation. The recovered DNA was dissolved in TE 
solution and then dephosphorylated by CIAP (TAKARA) . The 
reaction solution was electrophoresed on an agarose gel. 
and then a vector fragment was purified from the gel using 
QIAEXII (QIAGEN) . 

The two fragments prepared above, i.e. a 9.7 kb 
fragment from XSE1 and a vector fragment were subjected to 
a ligation reaction using DNA Ligation Kit Ver. 1 (TAKARA). 
After the reaction, DNA was recovered by ethanol 
precipitation. The recovered DNA was dissolved in pure 
water (prepared by a Millipore system) and then mixed with 
E. coli DH5a cells, and the mixture was electroporated. 
After electroporation. the solution was cultured with 
shaking in LB medium (37°C. 1 hr) and then plated on an LB 
plate containing spectinomycin and warmed (37°C, 16 hr) . 
Plasmids were isolated from 24 of the resulting colonies. 
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Their restriction enzyme fragment length patterns and 
boundary base sequences were analyzed to select desired E. 
coli cells transformed with recombinant plasmids . 

5 The E. coli cells selected above were used for 

triparental mating with the Agrobacterium tumefaciens 
strain LBA4404/pSBl (Komari et al. f 1996) and the helper E. 
coli strain HB101/pRK2013 (Ditta et al . , 1980) according to 
the method of Ditta et al . (1980). Plasmids were isolated 
10 from 6 of the colonies formed on an AB plate containing 

spectinomycin and their restriction enzyme fragment length 
patterns were analyzed to select desired Agrobacterium 
cells . 

15 The Agrobacterium cells selected above were used to 

transform MS Koshihikari (having BT cytoplasm and a nucleus 
gene substantially identical to Koshihikari) according to 
the method of Hiei et al. (1994). Necessary immature seeds 
of MS Koshihikari for transformation can be prepared by 

20 pollinating MS Koshihikari with Koshihikari. 

Transformed plants were transferred to a greenhouse 
under long-day conditions after acclimation. 48 
individuals grown to a stage suitable for transplantation 
25 were transplanted into l/5000a Wagner pots (4 

individuals /pot ) , and transferred into a greenhouse under 
short-day conditions 3-4 weeks after transplantation. 
About one month after heading, seed fertility was tested on 



standing plants. 



(Results and Discussion) 

All of the 48 transformed individuals were sterile. 
This indicates that the 9.7 kb insert fragment does not 
contain at least the full-length Rf-1 gene. 

Exam ple 5: Compl e mentation assay for a 14,7 Kb fragment 
from XSE7 

(Materials and Methods) 

The X phage clone XSE7 (Figs. 1 and 5) was completely 
digested with EcoRI and then DNA was recovered by ethanol 
precipitation. The recovered DNA was dissolved in TE 
solution and then blunted by DNA Blunting Kit (TAKARA) . 
The reaction solution was electrophoresed on an agarose gel 
to separate a 14.7 kb fragment (including bases 2618-17261 
of SEQ ID NO:27), which was purified by QIAEXII (QIAGEN) . 

On the other hand, the plasmid vector pSB200 was 
completely digested with Sad and then DNA was recovered by 
ethanol precipitation. The recovered DNA was dissolved in 
TE solution and then dephosphorylated by CIAP (TAKARA) and 
DNA was recovered by ethanol precipitation. The recovered 
DNA was dissolved in TE solution and then blunted by DNA 
Blunting Kit (TAKARA). The reaction solution was 
electrophoresed on an agarose gel, and then a vector 
fragment was purified from the gel using QIAEXII (QIAGEN). 



The two fragments prepared above, i.e. the 14.7 kb 
fragment from XSE7 and the vector fragment were subjected 
to a ligation reaction using DNA Ligation Kit Ver. 1 
(TAKARA) . Subsequently, transformed plants were prepared 
and studied according to the method described in Example 4. 

(Results and Discussion) 

All of the 48 transformed individuals were sterile. 
This indicates that the 14.7 kb insert fragment does not 
contain at least the full-length Rf-1 gene. 

Example 6: Complementation assay for a 21.3 kb fragment 
f ro m X S F4 

(Materials and Methods) 

The X phage clone XSF4 (Figs. 1 and 5) was partially 
digested with NotI and electrophoresed on an agarose gel. 
The separated 21.3 kb fragment (including bases 12478-33750 
of SEQ ID NO:27) was purified by QIAEXII (QIAGEN) . 

On the other hand, the plasmid vector pSB200 was 
completely digested with Noti and then DNA was recovered by 
ethanol precipitation. The recovered DNA was dissolved in 
TE solution and then dephosphorylated by CIAP (TAKARA). 
The reaction solution was electrophoresed on an agarose 
gel , and then a vector fragment was purified from the gel 
using QIAEXII (QIAGEN). 

The two fragments prepared above, i.e. the 21.3 kb 
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fragment from XSF4 and the vector fragment were subjected 
to a ligation reaction using DNA Ligation Kit Ver. 1 
(TAKARA) . Subsequently, transformed plants were prepared 
and studied according to the method described in Example 4. 

(Results and Discussion) 

All of the 48 transformed individuals were sterile. 
This indicates that the 21.3 kb insert fragment does not 
contain at least the full-length Rf-1 gene. 

Exam ple 7 ; Complementation a ssay for a 13.2 Kb fragment 
from XSF20 

(Materials and Methods) 

The X phage clone XSF20 (Figs. 1 and 5) was completely 
digested with Sail and then DNA was recovered by ethanol 
precipitation. The recovered DNA was dissolved in TE 
solution and then blunted by DNA Blunting Kit (TAKARA). 
The reaction solution was electrophoresed on an agarose gel 
to separate a 13.2 kb fragment (including bases 26809-40055 
of SEQ ID NO:27), which was purified by QIAEXII (QIAGEN) . 

On the other hand, the plasmid vector pSB200 was 
completely digested with EcoRV and then DNA was recovered 
by ethanol precipitation. The recovered DNA was dissolved 
in TE solution and then dephosphorylated by CIAP (TAKARA). 
The reaction solution was electrophoresed on an agarose 
gel, and then a vector fragment was purified from the gel 
using QIAEXII (QIAGEN). 
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The two fragments prepared above, i.e. the 13.2 kb 
fragment from XSF20 and the vector fragment were subjected 
to a ligation reaction using DNA Ligation Kit Ver. 1 
(TAKARA) . Subsequently, transformed plants were prepared 
and studied according to the method described in Example 4. 

(Results and Discussion) 

All of the 44 transformed individuals were sterile. 
This indicates that the 13.2 kb insert fragment does not 
contain at least the full-length Rf-1 gene. 

Exam ple 8; Complementation as say for a 16,2 K b fragment 
from XSF18 

(Materials and Methods) 

The X phage clone XSF18 is identical to XSF20 at the 
5' and 3' ends (bases 20328 and 41921 of SEQ ID NO: 27, 
respectively), but lacks internal bases 33947-38591. Thus, 
it comprises bases 20328-33946 and 38592-41921 of SEQ ID 
NO: 27. This is because clone XSF18 was initially isolated 
but found to contain the above deletion during 
amplification after isolation, and therefore, the 
amplification step was freshly taken to isolate a complete 
clone designated XSF20. 

The X phage clone XSF18 (Fig. 5) was completely 
digested with NotI and electrophoresed on an agarose gel. 
The separated 16.2 kb fragment (including bases 21065-33946 
and 38592-41921 of SEQ ID NO: 27) was purified by QIAEXII 
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(QIAGEN) . 



On the other hand, the plasmid vector pSB200 was 
completely digested with NotI and then DNA was recovered by 
ethanol precipitation. The recovered DNA was dissolved in 
TE solution and then dephosphorylated by CIAP (TAKARA) . 
The reaction solution was electrophoresed on an agarose 
gel, and then a vector fragment was purified from the gel 
using QIAEXII (QIAGEN). 

The two fragments prepared above, i.e. the 16.2 kb 
fragment from XSF18 and the vector fragment were subjected 
to a ligation reaction using DNA Ligation Kit Ver. 1 
(TAKARA). Subsequently, transformed plants were prepared 
and studied according to the method described in Example 4. 

(Results and Discussion) 

All of the 48 transformed individuals were sterile 
(Fig. 6). This indicates that the 16.2 kb insert fragment 
does not contain at least the full-length Rf-1 gene. 

Example 9: Complementation assay for a 12.6 kb fragme nt 
from XSG22 

(Materials and Methods) 

The X phage clone XSG22 (Figs. 1 and 5) was partially 
digested with NotI and electrophoresed on an agarose gel. 
The separated 12.6 kb fragment (including bases 31684-44109 
of SEQ ID NO:27) was purified by QIAEXII (QIAGEN). 
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On the other hand, the plasmid vector pSB200 was 
completely digested with NotI and then DNA was recovered by 
ethanol precipitation. The recovered DNA was dissolved in 
TE solution and then dephosphorylated by CIAP (TAKARA). 
The reaction solution was electrophoresed on an agarose 
gel, and then a vector fragment was purified from the gel 
using QIAEXII (QIAGEN) . 

The two fragments prepared above, i.e. the 12.6 kb 
fragment from XSG22 and the vector fragment were subjected 
to a ligation reaction using DNA Ligation Kit Ver. 1 
(TAKARA). Subsequently, transformed plants were prepared 
and studied according to the method described in Example 4. 

(Results and Discussion) 

All of the 48 transformed individuals were sterile. 
This indicates that the 12.6 kb insert fragment does not 
contain at least the full-length Rf-1 gene. 

Example 10; (11 Complementation assay for a 15.7 kb 

fragment from 
(Materials and Methods) 

The X phage clone XSG16 (Figs. 1 and 5) was partially 
digested with NotI and electrophoresed on an agarose gel. 
The separated 15.7 kb fragment (including bases 38538-54123 
of SEQ ID NO: 27) was purified by QIAEXII (QIAGEN). 



On the other hand, the plasmid vector pSB200 was 
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completely digested with Not I and then DNA was recovered by 
ethanol precipitation . The recovered DNA was dissolved in 
TE solution and then dephosphorylated by CIAP (TAKARA) . 
The reaction solution was electrophoresed on an agarose 
5 gel, and then a vector fragment was purified from the gel 
using QIAEXII (QIAGEN) . 

The two fragments prepared above, i.e. the 15.7 kb 
fragment from XSG16 and the vector fragment were subjected 
10 to a ligation reaction using DNA Ligation Kit Ver. 1 

(TAKARA). Subsequently, transformed plants were prepared 
and studied according to the method described in Example 4. 

(Results and Discussion) 
15 Of the 47 transformed individuals, at least 37 

individuals clearly restored fertility (Fig. 6). This 
indicates that 15586 bases (bases 38538-54123 of SEQ ID 
NO: 27) derived from rice (IR24) in the 15.7 kb insert 
fragment include the full-length Rf-1 gene. 

20 

(2) Complementation assay for an internal 11.4 kb fragment 
in XSG16 

(Materials and Methods) 

The X phage clone XSG16 was completely digested with 
25 AlwNI and BsiWI and then DNA was recovered by ethanol 
precipitation. The recovered DNA was dissolved in TE 
solution and then blunted by DNA Blunting Kit (TAKARA) . 
The reaction solution was electrophoresed on an agarose gel 
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to separate a 11.4 kb fragment, which was purified by 
QIAEXII (QIAGEN) . 

The plasmid vector pSBll (Komari et al. Plant Journal, 
1996) was completely digested with Smal and then DNA was 
recovered by ethanol precipitation. The recovered DNA was 
dissolved in TE solution and then dephosphorylated by CIAP 
(TAKARA) . The reaction solution was electrophoresed on an 
agarose gel, and then a vector fragment was purified from 
the gel using QIAEXII (QIAGEN). 

The two fragments prepared above were subjected to a 
ligation reaction using DNA Ligation Kit Ver. 1 (TAKARA). 
After the reaction, DNA was recovered by ethanol 
precipitation. The recovered DNA was dissolved in pure 
water (prepared by a Millipore system) and then mixed with 
E. coli DH5a cells, and the mixture was electroporated. 
After electroporation, the solution was cultured with 
shaking in LB medium (37°C, 1 hr) and then plated on an LB 
plate containing spectinomycin and warmed (37°C, 16 hr) . 
Plasmids were isolated from 14 of the resulting colonies, 
and their restriction enzyme fragment length patterns and 
boundary base sequences were analyzed to select desired E. 
coli cells . 

The E. coli cells selected above were used for 
triparental mating with the Agrobacterium tumefaciens 
strain LBA4404/pSB4U (Takakura et al . , Japanese Patent 



Application No. 2001-269982 (WO02/019803 Al)) and the 
helper E. coli strain HB101/pRK2013 (Ditta et al . , 1980) 
according to the method of Ditta et al. (1980). Plasmids 
were isolated from 12 of the colonies formed on an AB plate 
containing spectinomycin and their restriction enzyme 
fragment length patterns were analyzed to select desired 
Agrobacterium cells . 

The Agrobacterium cells selected above were used to 
transform MS Koshihikari (having BT cytoplasm and a nucleus 
gene substantially identical to Koshihikari) according to 
the method of Hiei et al. (1994). Necessary immature seeds 
of MS Koshihikari for transformation can be prepared by 
pollinating MS Koshihikari with Koshihikari. 

Transformed plants were transferred to a greenhouse 
under long-day conditions after acclimation. 120 
individuals grown to a stage suitable for transplantation 
were transplanted into l/5000a Wagner pots (4 
individuals /pot ) , and transferred into a greenhouse under 
short-day conditions about one month after transplantation. 
About one month after heading, one typical ear was sampled 
from each plant to evaluate seed fertility (the percentage 
of fertile paddies to total paddies). 

(Results and Discussion) 

Of the 120 transformed individuals, 59 individuals 
showed seed fertility of 10% or more, among which 19 
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individuals showed seed fertility of 70% or more. This 
indicates that the 11.4 kb insert fragment (bases 42357- 
53743 of SEQ ID NO: 27) contains an essential Rf-1 gene 
region for expressing a fertility restoring function. 

(3) Complementation assay for an internal 6 . 8 kb fragment 
in XSG16 

(Materials and Methods) 

The X phage clone XSG16 was completely digested with 
Hpal and AlwNI and electrophoresed on an agarose gel. The 
separated 6 . 8 kb fragment was purified by QIAEXII (QIAGEN) . 

The subsequent procedures including the preparation of 
the plasmid vector pSBll were performed according to the 
method in (2) above. 

(Results and Discussion) 

Of the 120 transformed individuals, 67 individuals 
showed seed fertility of 10% or more, among which 26 
individuals showed seed fertility of 70% or more. This 
indicates that the 6 . 8 kb insert fragment (bases 42132- 
48883 of SEQ ID NO: 27) contains an essential Rf-1 gene 
region for expressing a fertility restoring function. 

Example 11: Complementation assay for a 16.9 kb fragment 
from XSG8 

(Materials and Methods) 

The X phage clone XSG8 (Figs. 1 and 5) was completely 
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digested with. Not I and electrophoresed on an agarose gel. 
The separated 16.9 kb fragment (including bases 46558-63364 
of SEQ ID NO:27) was purified by QIAEXII (QIAGEN) . 

On the other hand, the plasmid vector pSB200 was 
completely digested with NotI and then DNA was recovered by 
ethanol precipitation. The recovered DNA was dissolved in 
TE solution and then dephosphorylated by CIAP (TAKARA) . 
The reaction solution was electrophoresed on an agarose 
gel, and then a vector fragment was purified from the gel 
using QIAEXII (QIAGEN). 

The two fragments prepared above, i.e. the 16.9 kb 
fragment from XSG8 and the vector fragment were subjected 
to a ligation reaction using DNA Ligation Kit Ver. 1 
(TAKARA). Subsequently, transformed individuals were 
prepared and studied according to the method described in 
Example 4 . 

(Results and Discussion) 

All of the 48 transformed individuals were sterile. 
This indicates that the 16.9 kb insert fragment does not 
contain at least the full-length Rf-1 gene. 

Example 12s Complementation assay for a 20.0 kb fragment 
from XSH18 

(Materials and Methods) 

The X phage clone XSH18 (Figs. 1 and 5) was completely 
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digested with NotI and electrophoresed on an agarose gel. 
The separated 20.0 kb fragment (including bases 56409-76363 
of SEQ ID NO:27) was purified by QIAEXII (QIAGEN) . 

5 On the other hand, the plasmid vector pSB200 was 

completely digested with NotI and then DNA was recovered by 
ethanol precipitation. The recovered DNA was dissolved in 
TE solution and then dephosphorylated by CIAP (TAKARA) . 
The reaction solution was electrophoresed on an agarose 
10 gel, and then a vector fragment was purified from the gel 
using QIAEXII (QIAGEN). 

The two fragments prepared above, i.e. the 20.0 kb 
fragment from XSH18 and the vector fragment were subjected 
15 to a ligation reaction using DNA Ligation Kit Ver. 1 
(TAKARA) . Subsequently, transformed individuals were 
prepared and studied according to the method described in 
Example 4 . 



20 (Results and Discussion) 

All of the 44 transformed individuals were sterile. 
This indicates that the 20.0 kb insert fragment does not 
contain at least the full-length Rf-1 gene. 

25 Example 13: Comp lementation assay for a 19.7 kb fragment 
from an overlapping regi on of XSG8 and XSH18 
(Materials and Methods) 

A plasmid (XSG8SB200F) isolated from desired E. coli 
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cells obtained by ligation in Example 11 was completely 
digested with Sail and StuI and electrophoresed on an 
agarose gel. The separated 12.8 kb fragment (including 
bases 50430-63197 of SEQ ID NO: 27) was purified by QIAEXII 
5 (QIAGEN). 

\ 

On the other hand, a plasmid (XSH18SB200R) isolated 
from desired E. coli cells obtained by ligation in Example 
12 was completely digested with Sail, StuI and Xhol and 
10 electrophoresed on an agarose gel to separate a 6.9 kb 
fragment (including bases 63194-70116 of SEQ ID NO: 27), 
which was purified by QIAEXII (QIAGEN). 

Further, the plasmid vector pSB200 was completely 
15 digested with EcoRV and then DNA was recovered by ethanol 
precipitation. The recovered DNA was dissolved in TE 
solution and then dephosphorylated by CIAP (TAKARA) . The 
reaction solution was electrophoresed on an agarose gel, 
and then a vector fragment was purified from the gel using 
20 QIAEXII (QIAGEN). 

The three fragments prepared above, i.e. the 12.8 kb 
fragment from XSG8, the 6.9 kb fragment from XSH18 and the 
vector fragment were subjected to a ligation reaction using 
25 DNA Ligation Kit Ver. 1 (TAKARA). The ligation product 

contains a 19.7 kb fragment from an overlapping region of 
XSG8 and XSH18 (including 50430-70116 of SEQ ID NO:27) 
(XSX1 in Fig. 5). Subsequently, transformed individuals 
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were prepared and studied according to the method described 
in Example 4 . 

(Results and Discussion) 
5 All of the 40 transformed individuals were sterile. 

This indicates that the 19.7 kb insert fragment does not 
contain at least the full-length Rf-1 gene. 

Example 14: Preparation of cPNA library 

10 Firstly, IL216, a line wherein the Rf-1 is introduced 

into Koshihikari via backcrossing (the genotype, Rf-l/Rf- 
1), was prepared. The IL216 was grown in a greenhouse by a 
conventional method, and young panicles were sampled during 
the growth stage wherein the length between auricles is -5 

15 - 5 cm. Total RNA was extracted by the SDS-phenol method 
(Watanabe, A. and Price, C.A. (1982) Translation of mRNAs 
for subunits of chloroplast coupling factor 1 in spinach. 
Proceedings of the National Academy of Sciences of the 
U.S.A., 79, 6304-6308), and the poly (A) + RNA was purified 

20 using QuickPrep mRNA Purification Kit(Amersham Pharmacia 
Biotech) . 

The purified poly (A)+ RNA was provided to prepare a 
cDNA library by ZAP-cDNA Synthesis Kit ( Stratagene ) . The 
2 5 titer of the prepared library (1ml) was calculated to be 
16,000,000 pfu/ml, and was determined to be sufficiently 
large . 
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Example 15; Screening of the cDNA library 

(1) Preparation of the screening primers 

PCR was performed by using the following two types of 

primes : 

5 Sense primer 

5 ' -tctcattctctccacgccctgctc-3 ' ( SEQ ID NO: 76) 

Antisense primer 

5 ' -acggcggagcaattcgtcgaacac-3 ' (SEQ ID NO: 77) 

and XSG16, a genomic clone of IR24, as a template. SEQ ID 
10 NOS:76 and 77 correspond to the bases 43733-43756 and the 
bases 44038-44015 of SEQ ID NO: 27, respectively. 

After the electrophoresis, the amplification product 
of about 300 bp was recovered from the agarose gel by QIAEX 
15 II Gel Extraction Kit (QIAGEN) . The recovered fragment was 
32 P-labeld by Rediprime II DNA labelling system (Amersham 
Pharmacia Biotech) (The fragment is hereunder referred to 
as "Probe P"). 

20 Further, PCR was performed by using the following two 

types of primes : 

Sense primer 
5 ' -agtgtgtggcatggtgcatttccg-3 ' (SEQ ID NO: 78) 

Antisense primer 

25 5 ' -ctctacaggatacacggtgtaagg-3 ' (SEQ ID NO: 79) 

and XSG16 , a genomic clone of IR24, as a template. SEQ ID 
NOS:78 and 79 correspond to the bases 48306-48329 and the 
bases 50226-50203 of SEQ ID NO: 27, respectively. After the 



electrophoresis, the amplification product of about 1900 bp 
was recovered from the agarose gel. The recovered fragment 
was 32 P-labeld by the method mentioned above (The fragment 
is hereunder refers to as "Probe Q" ) . 

5 

(2) Screening of the cDNA libbary 

The cDNA library prepared in Example 14 was provided 
to prepare 70 of agar medium wherein about 15000 plaques 
appeared. Plaque lift was performed twice for each agar 
10 medium, and the plaques were transferred to Hybond-N* 

(Amersham Pharmacia Biotech) . One membrane was used for 
hybridization with Probe P, and the other membrane was used 
for hybridization with Probe Q. The whole steps were 
performed according to the manufacture's instructions. 

15 

Probes were added to a hybridization solution 
containing 250M Na 2 HP0 4 , ImM EDTA and 7% SDS, and 
hybridization was performed at 65°C for 16 hours. Washing 
was performed twice with a solution containing 1 X SSC and 
20 0.1% SDS, at 65°C for 15 minutes, and then twice with a 

solution containing 0.1 X SSC and 0.1% SDS, at 65°C for 15 
minutes. After the washing, the membranes were analyzed 
with FUJIX BAS 1000 (Fuji Photo Films). 

25 As a result, 8 plaques which showed positive for both 

Probe P and Probe Q were identified. Therefore, those 
plaques were isolated, subcloned into pBluescript according 
to the instructions of the manufacture ( Stratagene) . Among 
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8 clones, the terminal base sequences of 6 clones were 
identical to that of XSG16 . The entire base sequences of 
the 6 clones were determined, and the results are shown in 
SEQ ID NOS: 69-74 in the sequence listing. 

5 

All of the sequences, SEQ ID NOS: 69-74 are presumed to 
encode a protein having the amino acids 1-791 of SEQ ID 
NO: 75. Specifically, all and each of the 215-2587 of SEQ 
ID NO: 69, the bases 213-2585 of SEQ ID NO: 70, the bases 
10 218-2590 of SEQ ID NO:71, the bases 208-2580 of SEQ ID 
NO: 72, the bases 149-2521 of SEQ ID NO: 73 and the bases 
225-2597 of SEQ ID NO: 74 encodes a protein having amino 
acids 1-791 of SEQ ID NO: 75. The above base sequences 
correspond to the bases 43907-46279 of SEQ ID NO: 27. 

15 

The amino acid sequence of SEQ ID NO: 75 was compared 
with the presumed amino acid sequence of the corn fertility 
restorer gene (Rf2), and the N-terminal 7 amino acid 
residues (Met -Ala-Arg-Arg-Ala-Ala-Ser ) in both amino acid 

20 sequences were concurred. These 7 amino acid residues are 
considered to be a portion of a targeting signal to 
mitochondria (Liu et al., 2001). Based on the above facts, 
the cDNAs isolated on this occasion are considered to 
contain the full coding region of the Rf-1 gene. No 

2 5 homology between the amino acid sequences of the rice Rf-1 
and the corn Rf2 can be found except for the above region. 
It is presumed that the mechanisms by which the gene 
products of the Rf-1 and the Rf2 can restore fertility 
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after being transferred to mitochondria are distinct from 
each other. 

In addition, the sequences of cDNAs isolated on this 
5 occasion were compared with the genome sequence of IR2 4 

(SEQ ID NO:27) # and the structures of exons and introns of 
the Rf-1 gene were clarified (Fig. 7). As a result, it was 
shown that various transcription products wherein the 
splicing forms and the poly A addition positions are 
10 different, are present in a plant body. There is no intron 
in the coding region of the Rf-1 gene. 

Example 16: Complementatio n assay 

A complementation assay was performed by using a 4.2 
15 kb fragment containing the promoter region and the presumed 
translation region of the Rf-1 gene. The 4.2 kb fragment 
is in a plasmid containing the 6 . 8 kb genome derive from 
IR24 which proved to have fertility restorer function in 
Example 10(3) . 

20 

Firstly, the plasmid described in Example 10(3) was 
treated with EcoRI , and was subjected to electrophoresis 
with agarose gel. The 4 . 2 kb fragment containing the 
promoter region and the presumed translation region of the 
25 Rf-1 (corresponding to the bases 42132-46318 in SEQ ID 

NO: 27) was separated, recovered from the gal using QIAEXII 
(QIAGEN) . The 4 . 2 kb fragment was subjected to ligation 
reaction using DNA Ligation Kit Ver.l (TAKARA) together 
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with pBluescript II SK (-) which has been treated with 
EcoRI and then with CIAP (TAKARA) . After the reaction, the 
DNA was recovered by ethanol precipitation. 

5 The recovered DNA was dissolved in pure water 

(prepared by a Millipore system) and then mixed with E. 
coli DH5a cells, and the mixture was electroporated . After 
electroporation, the solution was cultured by shaking in LB 
medium (37°C, 1 hr) and then plated on an LB plate 

10 containing ampicillin and warmed (37°C, 16 hr) . Plasmids 
were isolated from 12 of the resulting colonies, and their 
restriction enzyme fragment length patterns and boundary 
base sequences were analyzed to select desired E. coli 
cells. Then, plasmids isolated from the selected E. coli 

15 were treated with BamHI and Sail, and electrophoreses on an 
agarose gel. The 4.2 kb fragment containing the promoter 
region and the presumed translation region of Rf-1 was 
separated, and recovered from the gel using QIAEXII 
(QIAGEN) . 

20 

On the other hand, TnosJH0072 (an intermediate vector 
comprising the nos terminator and a cassette of the 
ampicillin resistant gene) was treated with BamHI and Sail, 
and electrophored on a agarose gel. The 3.0 kb fragment 
25 containing the nos terminator and the ampicillin-resistant 
gene was separated, and was recovered from the gel using 
QIAEXII (QIAGEN). 
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The 4.2 kb fragment containing the promoter region and 
the presumed translation region of Rf-1, and the fragment 
derived from TnosJH0072 were subjected to ligation 
reaction, and to electroporation by the methods discussed 
5 above. The reactant was spread on LB plates containing 
ampicillin, and incubated (37°C, 16 hr) . Plasmids were 
isolated from 12 of the resulting colonies, and their 
restriction enzyme fragment length patterns and boundary 
base sequences were analyzed to select desired E. coli 
10 cells. 

Further, plasmids isolated from the selected E. coli 
were treated with Sgfl, and electrophoresed on an agarose 
gel . The 4.2 kb fragment containing the promoter region 

15 and the presumed translation region of Rf-1 was separated, 
and recovered from the gel using QIAEXII (QIAGEN) . The 
4.2 kb fragment and pSB200Pac (an intermediate vector 
comprising a cassette of the hygromycin-resistant gene) 
which has been treated with Pad and then with CIAP 

20 (TAKARA) were subjected to ligation reaction, and to 
electroporation by the methods discussed above. The 
reactant was spread on LB plates containing spectinomycin , 
and incubated (37°C, 16 hr) . Plasmids were isolated from 
16 of the resulting colonies, and their restriction enzyme 

25 fragment length patterns and boundary base sequences were 
analyzed to select desired E. coli cells. 



As a result of the above steps, E. coli cells were 
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obtained wherein the chimera gene of the fragment 
containing the promoter region of the Rf-1 and the presumed 
translation region of the Rf-1 attached with the nos 
terminator has been inserted within an intermediate vector. 
5 The E. coli cells were used for triparental mating with the 
Agrobacterium tumefaciens strain LBA4404/pSBl (Komari et 
al., 1996) and the helper E. coli strain HB101/pRK2013 
(Ditta et al. , 1980) according to the method of Ditta et 
al. (1980). Plasmids were isolated from 6 of the colonies 
10 formed on an AB plate containing spectinomycin and their 
restriction enzyme fragment length patterns were analyzed 
to select desired Agrobacterium cells. 

The Agrobacterium cells selected above were used to 
15 transform MS Koshihikari (having BT cytoplasm and a nucleus 
gene substantially identical to Koshihikari ) according to 
the method of Hiei et al. (1994). Necessary immature seeds 
of MS Koshihikari for transformation were prepared by 
pollinating MS Koshihikari with Koshihikari. 

20 

Transformed plants were transferred to a greenhouse 
under long-day conditions after acclimation. 32 
individuals grown to a stage suitable for transplantation 
were transplanted into l/5000a Wagner pots (4 
25 individuals /pot ) , and transferred into a greenhouse under 
short -day conditions 3-4 weeks after transplantation. 
About one month after heading, seed fertility was tested on 
standing plants. As a result, 28 individuals among the 32 
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transformed individuals restored fertility. 



By the above procedures, it has been experimentally 
demonstrated that the function of the Rf-1 gene can be 
5 furnished by expressing the presumed translation region. 

Example 17: I solation of cDNA 

In Example 15, the cDNA library derived from IL216 
young panicles was screened with Probe P and Probe Q. 
10 Plaques which are positive for both probes were isolated 
and analyzed, and 6 cDNA were isolated. In this example, 
similar screening was performed with Probe P and Probe R as 
mentioned below, and six additional cDNAs were isolated. 
Details are as follows. 

15 

Firstly, PCR was performed by using the following two 

types of primes : 

Sense primer 

5 ' -cagttgggttgaaacctaatactg-3 ' (SEQ ID NO: 86) 
20 Antisense primer 

5 ' -cactaaaccgttagacgagaaagc-3 ' (SEQ ID NO: 87) 

and a genomic clone of IR24, XSG16 as a template. SEQ ID 
NOS:86 and 87 correspond to the bases 45522-45545 and the 
bases 45955-45932 of SEQ ID NO: 27, respectively. 

25 

After the electrophoresis, the amplification product 
of about 430 bp was recovered from the agarose gel by QIAEX 
II (QIAGEN) . The recovered fragment was 32 P-labeld by 
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Rediprime II DNA labelling system (Amersham Pharmacia 
Biotech) (hereinafter referred as "Probe R" , Fig. 8). 

The cDNA library derived from IL216 young panicles was 
5 provided to prepare 20 of agar medium wherein about 15000 
plaques appeared. Plaque lift was performed twice for each 
agar medium, and the plaques were transferred to Hybond-N + 
(Amersham Pharmacia Biotech) . One membrane was used for 
hybridization with Probe P of Example 15, and the other 
10 membrane was used for hybridization with Probe R. All of 
the steps were performed according to the manufacture's 
instructions. As a result, 12 plaques were identified 
which proved to be positive for both Probe P and Probe R. 

15 Accordingly, those plaques were isolated, and 

subcloned into pBluescript according to the instructions of 
the manufacture ( Staratagene) . The terminal base sequences 
of the cones were determined. Among 12 clones, the 
terminal base sequences of 6 clones were identical to that 

20 of XSG16, and thus the entire base sequences of those 6 

clones were determined (#7 - #12). The results were shown 
in SEQ ID NOS: 80-85. 

All of the sequences, SEQ ID NOS: 80-85 are presumed to 
25 encode a protein having the amino acids 1-791 of SEQ ID 

NO: 75. Specifically, all and each of the 229-2601 of SEQ 
ID NO: 80, the bases 175-2547 of SEQ ID NO: 81, the bases 
227-2599 of SEQ ID NO:82, the bases 220-2592 of SEQ ID 
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NO: 83, the bases 174-2546 of SEQ ID NO: 84 and the bases 90- 
2462 of SEQ ID NO: 85 encodes a protein having amino acids 
1-791 of SEQ ID NO: 75. The above base sequences correspond 
to the bases 43907-46279 of SEQ ID NO:27. 

5 

The sequences of cDNAs isolated on this occasion were 
compared with the genome sequence of IR24 (SEQ ID NO: 27), 
and the structures of exons and introns were clarified 
(Fig. 8). Among the cDNAs isolated on this occasion, there 
10 are three cDNAs which do not have any exons irrelevant to 
the presumed translation region, and consist of a single 
exon (#10 - #12, SEA ID NOS : 83-85). 



- 152 - 



